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NUCLEIC ACID SEQUENCES TO PROTEINS INVOLVED IN TOCOPHEROL 

SYNTHESIS 

INTRODUCTION 

TECHNICAL FIELD 
The present invention is directed to nucleic acid and amino acid sequences and 
constructs, and methods related thereto. 

BACKGROUND 

Isoprenoids are ubiquitous compounds found in all living organisms. Plants synthesize a 
diverse array of greater than 22,000 isoprenoids (Connolly and Hill (1992) Dictionary of 
Terpenoids, Chapman and Hall, New York, NY). In plants, isoprenoids play essential roles in 
particular cell functions such as production of sterols, contributing to eukaryotic membrane 
architecture, acyclic polyprenoids found in the side chain of ubiquinone and plastoquinone, 
growth regulators like abscisic acid, gibberellins, brassinosteroids or the photosynthetic pigments 
chlorophylls and carotenoids. Although the physiological role of other plant isoprenoids is less 
evident, like that of the vast array of secondary metabolites, some are known to play key roles 
mediating the adaptative responses to different environmental challenges. In spite of the 
remarkable diversity of structure and function, all isoprenoids originate from a single metabolic 
precursor, isopentenyl diphosphate (IPP) (Wright, (1961) Annu. Rev. Biochem. 20:525-548; and 
Spurgeon and Porter, (1981) in Biosynthesis of Isoprenoid Compounds .. Porter and Spurgeon eds 
(John Wiley, New York) Vol. 1, ppl-46). 

A number of unique and interconnected biochemical pathways derived from the 
isoprenoid pathway leading to secondary metabolites, including tocopherols, exist in chloroplasts 
of higher plants. Tocopherols not only perform vital functions in plants, but are also important 
from mammalian nutritional perspectives. In plastids, tocopherols account for up to 40% of the 
total quinone pool. 
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Tocopherols and tocotrienols (unsaturated tocopherol derivatives) are well known 
antioxidants, and play an important role in protecting cells from free radical damage, and in the 
prevention of many diseases, including cardiac disease, cancer, cataracts, retinopathy, 
Alzheimer's disease, and neurodegeneration, and have been shown to have beneficial effects on 
symptoms of arthritis, and in anti-aging. Vitamin E is used in chicken feed for improving the 
shelf life, appearance, flavor, and oxidative stability of meat, and to transfer tocols from feed to 
eggs. Vitamin E has been shown to be essential for normal reproduction, improves overall 
performance, and enhances immunocompetence in livestock animals. Vitamin E supplement in 
animal feed also imparts oxidative stability to milk products. 

The demand for natural tocopherols as supplements has been steadily growing at a rate of 
1 0-20% for the past three years. At present, the demand exceeds the supply for natural 
tocopherols, which are known to be more biopotent than racemic mixtures of synthetically 
produced tocopherols. Naturally occurring tocopherols are all d-stereomers, whereas synthetic a- 
tocopherol is a mixture of eight </, /-a-tocopherol isomers, only one of which (12.5%) is identical 
to the natural rf-a-tocopherol. Natural ^-a-tocopherol has the highest vitamin E activity (1.49 
IU/mg) when compared to other natural tocopherols or tocotrienols. The synthetic cc-tocopherol 
has a vitamin E activity of 1 . 1 IU/mg. In 1 995, the worldwide market for raw refined 
tocopherols was $1020 million; synthetic materials comprised 85-88% of the market, the 
remaining 12-15% being natural materials. The best sources of natural tocopherols and 
tocotrienols are vegetable oils and grain products. Currently, most of the natural Vitamin E is 
produced from y-tocopherol derived from soy oil processing, which is subsequently converted to 
a-tocopherol by chemical modification (a-tocopherol exhibits the greatest biological activity). 

Methods of enhancing the levels of tocopherols and tocotrienols in plants, especially levels 
of the more desirable compounds that can be used directly, without chemical modification, would be 
useful to the art as such molecules exhibit better functionality and biovailability. 

In addition, methods for the increased production of other isoprenoid derived compounds in 
a host plant cell is desirable. Furthermore, methods for the production of particular isoprenoid 
compounds in a host plant cell is also needed. 
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SUMMARY OF THE INVENTION 

The present invention is directed to sequences to proteins involved in tocopherol 
synthesis. The polynucleotides and polypeptides of the present invention include those derived 
from prokaryotic and eukaryotic sources. 

Thus, one aspect of the present invention relates to prenyltransferase, and in particular to 
isolated polynucleotide sequences encoding prenyltransferase proteins and polypeptides related 
thereto. In particular, isolated nucleic acid sequences encoding prenyltransferase proteins from 
bacterial and plant sources are provided. 

In another aspect, the present invention provides isolated polynucleotide sequences 
encoding tocopherol cyclase, and polypeptides related thereto. In particular, isolated nucleic acid 
sequences encoding tocopherol cyclase proteins from bacterial and plant sources are provided. 

Another aspect of the present invention relates to oligonucleotides which include partial 
or complete prenyltransferase or tocopherol cyclase encoding sequences. 

It is also an aspect of the present invention to provide recombinant DNA constructs which 
can be used for transcription or transcription and translation (expression) of prenyltransferase or 
tocopherol cyclase. In particular, constructs are provided which are capable of transcription or 
transcription and translation in host cells. 

In another aspect of the present invention, methods are provided for production of 
prenyltransferase or tocopherol cyclase in a host cell or progeny thereof. In particular, host cells 
are transformed or transfected with a DNA construct which can be used for transcription or 
transcription and translation of prenyltransferase or tocopherol cyclase. The recombinant cells 
which contain prenyltransferase or tocopherol cyclase are also part of the present invention. 

In a further aspect, the present invention relates to methods of using polynucleotide and 
polypeptide sequences to modify the tocopherol content of host cells, particularly in host plant 
cells. Plant cells having such a modified tocopherol content are also contemplated herein. 
Methods and cells in which both prenyltransferase and tocopherol cyclase are expressed in a host 
cell are also part of the present invention. 

The modified plants, seeds and oils obtained by the expression of the prenyltransferase or 
tocopherol cyclase are also considered part of the invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides an amino acid sequence alignment between ATPT2, ATPT3, ATPT4, 
5 ATPT8, and ATPT12 are performed using ClustalW. 

Figure 2 provides a schematic picture of the expression construct pCGN 10800. 
Figure 3 provides a schematic picture of the expression construct pCGN 10801. 
Figure 4 provides a schematic picture of the expression construct pCGN10803. 
Figure 5 provides a schematic picture of the construct pCGN10806. 
1 0 Figure 6 provides a schematic picture of the construct pCGN 10807. 

□ Figure 7 provides a schematic picture of the construct pCGN10808. 

!S Figure 8 provides a schematic picture of the expression construct pCGN10809. 

* » H 

ffl Figure 9 provides a schematic picture of the expression construct pCGN10810. 

f i r.-H-.»| 
« r* 

□ Figure 10 provides a schematic picture of the expression construct pCGN1081 1. 

- - ii^*** 

^ 5 Figure 1 1 provides a schematic picture of the expression construct pCGN10812. 

r IHn| 

Figure 1 2 provides a schematic picture of the expression construct pCGN 1 08 1 3 . 
:;3 Figure 13 provides a schematic picture of the expression construct pCGN10814. 

: p Figure 1 4 provides a schematic picture of the expression construct pCGN 10815, 

Figure 15 provides a schematic picture of the expression construct pCGN10816. 

i n ■* 

2 0 Figure 1 6 provides a schematic picture of the expression construct pCGNl 08 1 7. 

Figure 17 provides a schematic picture of the expression construct pCGN10819. 
Figure 18 provides a schematic picture of the expression construct pCGN 10824. 
Figure 19 provides a schematic picture of the expression construct pCGN10825. 
Figure 20 provides a schematic picture of the expression construct pCGN10826. 
2 5 Figure 21 provides an amino acid sequence alignment using ClustalW between the 

* Synechocystis prenyltransf erase sequences. 

Figure 22 provides an amino acid sequence of the ATPT2, ATPT3, ATPT4, ATPT8, and 
ATPT12 protein sequences from Arabidopsis and the slrl736 5 slr0926, slll899 ? slr0056, and the 
slrl518 amino acid sequences from Synechocystis. 
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Figure 23 provides the results of the enzymatic assay from preparations of wild type 
Synechocystis strain 6803, and Synechocystis sir 1736 knockout. 

Figure 24 provides bar graphs of HPLC data obtained from seed extracts of transgenic 
Arabidopsis containing pCGNl 0822, which provides of the expression of the ATPT2 sequence, 
5 in the sense orientation, from the napin promoter. Provided are graphs for alpha, gamma, and 
delta tocopherols, as well as total tocopherol for 22 transformed lines, as well as a 
nontransformed (wildtype) control. 

Figure 25 provides a bar graph of HPLC analysis of seed extracts from Arabidopsis plants 
transformed with pCGN10803 (35S-ATPT2, in the antisense orientation), pCGN10822 (line 
1 0 1625, napin ATPT2 in the sense orientation), pCGN10809 (line 1627, 35S-ATPT3 in the sense 
* orientation), a nontransformed (wt) control, and an empty vector transformed control. 

1 Figure 26 shows total tocopherol levels measured in T# Arabidopsis seed of line. 
3 Figure 27 shows total tocopherol levels measured in T# Arabidopsis seed of line. 

3 Figure 28 shows total tocopherol levels measured in developing canola seed of line 

i5 10822-1. 

n 
#i 

44 

•w 

Figure 29: shows results of phytyl prenyltransferase activity assay using Synechocystis 
% wild type and slrl 737 knockout mutant membrane preparations. 

I Figure 30 is the chromatograph from an HPLC analysis of Synechocystis extracts, 

j Figure 3 1 is a sequence alignment of the Arabidopsis homologue with the sequence of the 

.4 

'2 0 public database. 

Figure 32 shows the results of hydropathic analysis of slrl 737 

Figure 33 shows the results of hydropathic analysis of the Arabidopsis homologue of 
slrl 737. 

* 

Figure 34 shows the catalytic mechanism of various cyclase enzymes 

2 5 ' Figure 35 is a sequence alignment of slrl 737, slrl 737 Arabidopsis homologue and the 

Arabidopsis chalcone isomerase. 



DETAILED DESCRIPTION OF THE INVENTION 
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The present invention provides, inter alia, compositions and methods for altering (for 
example, increasing and decreasing) the tocopherol levels and/or modulating their ratios in host 
cells. In particular, the present invention provides polynucleotides, polypeptides, and methods of 
use thereof for the modulation of tocopherol content in host plant cells. 

The biosynthesis of a-tocopherol in higher plants involves condensation of homogentisic 
acid and phytylpyrophosphate to form 2-methyl-6 phytylbenzoquinol that can, by cyclization and 
subsequent methylations (Fiedler et al, 1982, Planta, 155: 511-515, Soil et al, 1980, Arch. 
Biochem. Biophys. 204: 544-550, Marshall et al., 1985 Phytochem., 24: 1705-171 1, all of which 
are herein incorporated by reference in their entirety), form various tocopherols. 

The Arabidopsis pds2 mutant identified and characterized by Norris et al. (1995), is 
deficient in tocopherol and plastiquinone-9 accumulation. Further genetic and biochemical 
analysis suggested that the protein encoded by PDS2 may be responsible for the prenylation of 
homogentisic acid. The PDS2 locus identified by Norris et al (1 995) has been hypothesized to 
possibly encode the tocopherol phytyl-prenyltransferase, as the pds2 mutant fails to accumulate 
tocopherols. 

Norris et al. (1995) determined that in Arabidopsis pds2 lies at the top of chromosome 3, 
approximately 7 centimorgans above long hypocotyl2, based on the genetic map. ATPT2 is 
located on chromosome 2 between 36 and 41 centimorgans, lying on BAC F19F24, indicating 
that ATPT2 does not correspond to PDS2. Thus, it is an aspect of the present invention to 
provide novel polynucleotides and polypeptides involved in the prenylation of homogentisic acid. 
This reaction may be a rate limiting step in tocopherol biosynthesis, and this gene has yet to be 
isolated. 

U.S. Patent No. 5,432,069 describes the partial purification and characterization of 
tocopherol cyclase from Chlorella protothecoides, Dunaliella salina and wheat. The cyclase 
described as being glycine rich, water soluble and with a predicted MW of 48-50kDa. However, 
only limited peptide fragment sequences were available. 

In one aspect, the present invention provides polynucleotide and polypeptide sequences 
involved in the prenylation of straight chain and aromatic compounds. Straight chain 
prenyltransferases as used herein comprises sequences which encode proteins involved in the 
prenylation of straight chain compounds, including, but not limited to, geranyl geranyl 
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pyrophosphate and famesyl pyrophosphate. Aromatic prenyltransferases, as used herein, 
comprises sequences which encode proteins involved in the prenylation of aromatic compounds, 
including, but not limited to, menaquinone, ubiquinone, chlorophyll, and homogentisic acid. The 
prenyltransferase of the present invention preferably prenylates homogentisic acid. 

In another aspect, the invention provides polynucleotide and polypeptide sequences to 
tocopherol cyclization enzymes. The 2,3-dimethyl-5-phvtylplastoquinol cyclase (tocopherol 
cyclase) is responsible for the cyclization of 2,3-dimethyl-5-phytylplastoquinol to tocopherol 

Isolated Polynucleotides, Proteins, and Polypeptides 

A first aspect of the present invention relates to isolated prenyltransferase 
polynucleotides. Another aspect of the present invention relates to isolated tocopherol cyclase 
polynucleotides. The polynucleotide sequences of the present invention include isolated 
polynucleotides that encode the polypeptides of the invention having a deduced amino acid 
sequence selected from the group of sequences set forth in the Sequence Listing and to other 
polynucleotide sequences closely related to such sequences and variants thereof. 

The invention provides a polynucleotide sequence identical over its entire length to each 
coding sequence as set forth in the Sequence Listing. The invention also provides the coding 
sequence for the mature polypeptide or a fragment thereof, as well as the coding sequence for the 
mature polypeptide or a fragment thereof in a reading frame with other coding sequences, such as 
those encoding a leader or secretory sequence, a pre-, pro-, or prepro- protein sequence. The 
polynucleotide can also include non-coding sequences, including for example, but not limited to, 
non-coding 5 ' and 3' sequences, such as the transcribed, untranslated sequences, termination 
signals, ribosome binding sites, sequences that stabilize mRNA, introns, polyadenylation signals, 
and additional coding sequence that encodes additional amino acids. For example, a marker 
sequence can be included to facilitate the purification of the fused polypeptide. Polynucleotides 
of the present invention also include polynucleotides comprising a structural gene and the 
naturally associated sequences that control gene expression. 

The invention also includes polynucleotides of the formula: 

X-(Rl)n-(R2)"(R 3 )n-Y 



7 



Attorney Docket No: 
52015/01/US 

wherein, at the 5' end, X is hydrogen, and at the 3' end, Y is hydrogen or a metal, Ri and R 3 are 
any nucleic acid residue, n is an integer between 1 and 3000, preferably between 1 and 1000 and 
R 2 is a nucleic acid sequence of the invention, particularly a nucleic acid sequence selected from 
the group set forth in the Sequence Listing and preferably those of SEQ ID NOs: 1, 3, 5, 7, 8, 10, 
5 11, 13-16, 18, 23, 29, 36, and 38. In the formula, R 2 is oriented so that its 5' end residue is at the 
left, bound to R u and its 3' end residue is at the right, bound to R 3 . Any stretch of nucleic acid 
residues denoted by either R group, where R is greater than 1 , may be either a heteropolymer or a 
homopolymer, preferably a heteropolymer. 

The invention also relates to variants of the polynucleotides described herein that encode 
1 0 for variants of the polypeptides of the invention. Variants that are fragments of the 
n polynucleotides of the invention can be used to synthesize full-length polynucleotides of the 

invention. Preferred embodiments are polynucleotides encoding polypeptide variants wherein 5 
;;n to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues of a polypeptide sequence of the invention are 
^ substituted, added or deleted, in any combination. Particularly preferred are substitutions, 
;| 5 additions, and deletions that are silent such that they do not alter the properties or activities of the 
polynucleotide or polypeptide. 

Further preferred embodiments of the invention that are at least 50%, 60%, or 70% 
identical over their entire length to a polynucleotide encoding a polypeptide of the invention, and 
□ polynucleotides that are complementary to such polynucleotides. More preferable are 
2*0 polynucleotides that comprise a region that is at least 80% identical over its entire length to a 
polynucleotide encoding a polypeptide of the invention and polynucleotides that are 
complementary thereto. In this regard, polynucleotides at least 90% identical over their entire 
length are particularly preferred, those at least 95% identical are especially preferred. Further, 
those with at least 97% identity are highly preferred and those with at least 98% and 99% identity 

2 5 are particularly highly preferred, with those at least 99% being the most highly preferred. 

Preferred embodiments are polynucleotides that encode polypeptides that retain 
substantially the same biological function or activity as the mature polypeptides encoded by the 
polynucleotides set forth in the Sequence Listing. 

The invention further relates to polynucleotides that hybridize to the above-described 

3 0 sequences. In particular, the invention relates to polynucleotides that hybridize under stringent 
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conditions to the above-described polynucleotides. As used herein, the terms "stringent 
conditions" and "stringent hybridization conditions" mean that hybridization will generally occur 
if there is at least 95% and preferably at least 97% identity between the sequences. An example 
of stringent hybridization conditions is overnight incubation at 42°C in a solution comprising 
5 50% formamide, 5x SSC (150 mM NaCl, 1 5 mM trisodium citrate), 50 mM sodium phosphate 
(pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 micrograms/milliliter denatured, 
sheared salmon sperm DNA, followed by washing the hybridization support in 0.1 x SSC at 
approximately 65°C. Other hybridization and wash conditions are well known and are 
exemplified in Sambrook, et ah, Molecular Cloning: A Laboratory Manual, Second Edition, cold 
1 0 Spring Harbor, NY (1 989), particularly Chapter 1 1 . 

y*\ The invention also provides a polynucleotide consisting essentially of a polynucleotide 

* ' *!"!? 

+ b >-!■■> 

]j£ sequence obtainable by screening an appropriate library containing the complete gene for a 

F bl 

m polynucleotide sequence set for in the Sequence Listing under stringent hybridization conditions 

*» H 

with a probe having the sequence of said polynucleotide sequence or a fragment thereof; and 
;f 5 isolating said polynucleotide sequence. Fragments useful for obtaining such a polynucleotide 

include, for example, probes and primers as described herein. 
q As discussed herein regarding polynucleotide assays of the invention, for example, 

^ polynucleotides of the invention can be used as a hybridization probe for RNA, cDNA, or 
□ genomic DNA to isolate full length cDNAs or genomic clones encoding a polypeptide and to 
20 isolate cDNA or genomic clones of other genes that have a high sequence similarity to a 

polynucleotide set forth in the Sequence Listing. Such probes will generally comprise at least 15 
bases. Preferably such probes will have at least 30 bases and can have at least 50 bases. 
Particularly preferred probes will have between 30 bases and 50 bases, inclusive. 

The coding region of each gene that comprises or is comprised by a polynucleotide 
2 5 sequence set forth in the Sequence Listing may be isolated by screening using a DNA sequence 
provided in the Sequence Listing to synthesize an oligonucleotide probe. A labeled 
oligonucleotide having a sequence complementary to that of a gene of the invention is then used 
to screen a library of cDNA, genomic DNA or mRNA to identify members of the library which 
hybridize to the probe. For example, synthetic oligonucleotides are prepared which correspond 
30 to the prenyltransferase or tocopherol cyclase EST sequences. The oligonucleotides are used as 
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primers in polymerase chain reaction (PCR) techniques to obtain 5' and 3' terminal sequence of 
prenyltransferase or tocopherol cyclase genes. Alternatively, where oligonucleotides of low 
degeneracy can be prepared from particular prenyltransferase or tocopherol cyclase peptides, 
such probes may be used directly to screen gene libraries for prenyltransferase or tocopherol 
5 cyclase gene sequences. In particular, screening of cDNA libraries in phage vectors is useful in 
such methods due to lower levels of background hybridization. 

Typically, a prenyltransferase or tocopherol cyclase sequence obtainable from the use of 
nucleic acid probes will show 60-70% sequence identity between the target prenyltransferase or 
tocopherol cyclase sequence and the encoding sequence used as a probe. However, lengthy 
1 0 sequences with as little as 50-60% sequence identity may also be obtained. The nucleic acid 
. ^ probes may be a lengthy fragment of the nucleic acid sequence, or may also be a shorter, 

i 

■u oligonucleotide probe. When longer nucleic acid fragments are employed as probes (greater than 

H '> F> 

» ** tl 

1 * +1 

m about 100 bp), one may screen at lower stringencies in order to obtain sequences from the target 

M f "1*1 
» «< F> 

sample which have 20-50% deviation (i.e., 50-80% sequence homology) from the sequences 
••§5 used as probe. Oligonucleotide probes can be considerably shorter than the entire nucleic acid 

* t n 

sequence encoding an prenyltransferase or tocopherol cyclase enzyme, but should be at least 
: !« s about 10, preferably at least about 15, and more preferably at least about 20 nucleotides. A 

higher degree of sequence identity is desired when shorter regions are used as opposed to longer 

■ 

□ regions. It may thus be desirable to identify regions of highly conserved amino acid sequence to 
"20 design oligonucleotide probes for detecting and recovering other related prenyltransferase or 
tocopherol cyclase genes. Shorter probes are often particularly useful for polymerase chain 
reactions (PCR), especially when highly conserved sequences can be identified. (See, Gould, et 
ah, PNAS USA (1989) 36:1934-1938.). 

Another aspect of the present invention relates to prenyltransferase or tocopherol cyclase 

2 5 polypeptides. Such polypeptides include isolated polypeptides set forth in the Sequence Listing, 

as well as polypeptides and fragments thereof, particularly those polypeptides which exhibit 
prenyltransferase or tocopherol cyclase activity and also those polypeptides which have at least 
50%o, 60% or 70% identity, preferably at least 80% identity, more preferably at least 90% 
identity, and most preferably at least 95% identity to a polypeptide sequence selected from the 

3 0 group of sequences set forth in the Sequence Listing, and also include portions of such 
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polypeptides, wherein such portion of the polypeptide preferably includes at least 30 amino acids 
and more preferably includes at least 50 amino acids. 

"Identity", as is well understood in the art, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by comparing the 
5 sequences. In the art, "identity" also means the degree of sequence relatedness between 

polypeptide or polynucleotide sequences, as determined by the match between strings of such 
sequences. "Identity" can be readily calculated by known methods including, but not limited to, 
those described in Computational Molecular Biology, Lesk, A.M., ed., Oxford University Press, 
New York (1988); Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., 
1 0 Academic Press, New York, 1 993; Computer Analysis of Sequence Data, Part I, Griffin, A.M. 
ri and Griffin, H.G., eds., Humana Press, New Jersey (1994); Sequence Analysis in Molecular 

Biology, von Heinje, G., Academic Press (1987); Sequence Analysis Primer, Gribskov, M. and 

* + 1 ml 

m Devereux, J., eds., Stockton Press, New York.(1991); and Carillo, H., and Lipman, D., SIAM J 
^ Applied Math, 48: 1073 (1988). Methods to determine identity are designed to give the largest 
; | 5 match between the sequences tested. Moreover, methods to determine identity are codified in 

1 F *i 

publicly available programs. Computer programs which can be used to determine identity 
j 3 between two sequences include, but are not limited to, GCG (Devereux, J., et al., Nucleic Acids 
i™ Research 12(1):387 (1984); suite of five BLAST programs, three designed for nucleotide 

r 

i w ^-"i * 

Q sequences queries (BLASTN, BLASTX, and TBLASTX) and two designed for protein sequence 
"10 queries (BLASTP and TBLASTN) (Coulson, Trends in Biotechnology, 12: 76-80 (1994); Birren, 
et al, Genome Analysis, 1: 543-559 (1997)). The BLAST X program is publicly available from 
NCBI and other sources {BLAST Manual, Altschul, S., et al, NCBI NLM NiH, Bethesda, MD 
20894; Altschul, S., et al, J. Mol. Biol, 215:403-410 (1990)). The well known Smith Waterman 
algorithm can also be used to determine identity. 

2 5 Parameters for polypeptide sequence comparison typically include the following: 

Algorithm: Needleman and Wunsch, J. Mol. Biol. 48:443-453 (1970) 
Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoff, Proc. Natl. Acad. Sci 
USA 89:10915-10919(1992) 
Gap Penalty: 1 2 

3 0 Gap Length Penalty: 4 
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A program which can be used with these parameters is publicly available as the "gap" 
program from Genetics Computer Group, Madison Wisconsin. The above parameters along with 
no penalty for end gap are the default parameters for peptide comparisons. 

Parameters for polynucleotide sequence comparison include the following: 

Algorithm: Needleman and Wunsch, J. Mol. Biol. 48:443-453 (1970) 

Comparison matrix: matches = +10; mismatches = 0 

Gap Penalty: 50 

Gap Length Penalty: 3 

A program which can be used with these parameters is publicly available as the "gap" 
program from Genetics Computer Group, Madison Wisconsin. The above parameters are the 
default parameters for nucleic acid comparisons. 

The invention also includes polypeptides of the formula: 

X-(R0n-(R2)-(R 3 )n-Y 

wherein, at the amino terminus, X is hydrogen, and at the carboxyl terminus, Y is hydrogen or a 
metal, Ri and R3 are any amino acid residue, n is an integer between 1 and 1000, and R2 is an 
amino acid sequence of the invention, particularly an amino acid sequence selected from the 
group set forth in the Sequence Listing and preferably those encoded by the sequences provided 
in SEQ ID NOs: 2, 4, 6, % 12, 17, 19-22, 24-28, 30, 32-35, 37, and 39. In the formula, R 2 is 
oriented so that its amino terminal residue is at the left, bound to R i? and its carboxy terminal 
residue is at the right, bound to R 3 . Any stretch of amino acid residues denoted by either R 
group, where R is greater than 1, may be either a heteropolymer or a homopolymer, preferably a 
heteropolymer. 

Polypeptides of the present invention include isolated polypeptides encoded by a 
polynucleotide comprising a sequence selected from the group of a sequence contained in the 
Sequence Listing set forth herein . 

The polypeptides of the present invention can be mature protein or can be part of a fusion 

protein. 

Fragments and variants of the polypeptides are also considered to be a part of the 
invention. A fragment is a variant polypeptide which has an amino acid sequence that is entirely 
the same as part but not all of the amino acid sequence of the previously described polypeptides. 
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The fragments can be "free-standing" or comprised within a larger polypeptide of which the 
fragment forms a part or a region, most preferably as a single continuous region. Preferred 
fragments are biologically active fragments which are those fragments that mediate activities of 
the polypeptides of the invention, including those with similar activity or improved activity or 
5 with a decreased activity. Also included are those fragments that antigenic or immunogenic in an 
animal, particularly a human. 

Variants of the polypeptide also include polypeptides that vary from the sequences set 
forth in the Sequence Listing by conservative amino acid substitutions, substitution of a residue 
by another with like characteristics. In general, such substitutions are among Ala, Val, Leu and 
1 0 He; between Ser and Thr; between Asp and Glu; between Asn and Gin; between Lys and Arg; or 
between Phe and Tyr. Particularly preferred are variants in which 5 to 10; 1 to 5; 1 to 3 or one 

*■—»>■ 

• H 

if amino acid(s) are substituted, deleted, or added, in any combination. 

■+ »■ 

0 Variants that are fragments of the polypeptides of the invention can be used to produce 

, 

«■ 

*i i-i 

rt 

, ,IIW| 

*3 the corresponding full length polypeptide by peptide synthesis. Therefore, these variants can be 

HIV 

tf 

Jl 5 used as intermediates for producing the full-length polypeptides of the invention. 

The polynucleotides and polypeptides of the invention can be used, for example, in the 
3 transformation of host cells, such as plant host cells, as further discussed herein. 
"1 The invention also provides polynucleotides that encode a polypeptide that is a mature 

1 k^Vn 

i 

1 

3 protein plus additional amino or carboxyl-terminal amino acids, or amino acids within the mature 
2 0 polypeptide (for example, when the mature form of the protein has more than one polypeptide 
chain). Such sequences can, for example, play a role in the processing of a protein from a 
precursor to a mature form, allow protein transport, shorten or lengthen protein half-life, or 
facilitate manipulation of the protein in assays or production. It is contemplated that cellular 
enzymes can be used to remove any additional amino acids from the mature protein. 

2 5 A precursor protein, having the mature form of the polypeptide fused to one or more 

prosequences may be an inactive form of the polypeptide. The inactive precursors generally are 
activated when the prosequences are removed. Some or all of the prosequences may be removed 
prior to activation. Such precursor protein are generally called proproteins. 

3 0 Plant Constructs and Methods of Use 

13 
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Of particular interest is the use of the nucleotide sequences in recombinant DNA 
constructs to direct the transcription or transcription and translation (expression) of the 
prenyltransferase or tocopherol cyclase sequences of the present invention in a host plant cell. 
5 The expression constructs generally comprise a promoter functional in a host plant cell operably 
linked to a nucleic acid sequence encoding a prenyltransferase or tocopherol cyclase of the 
present invention and a transcriptional termination region functional in a host plant cell. 

A first nucleic acid sequence is "operably linked" or "operably associated" with a second 
nucleic acid sequence when the sequences are so arranged that the first nucleic acid sequence 
1 0 affects the function of the second nucleic-acid sequence. Preferably, the two sequences are part 
its% of a single contiguous nucleic acid molecule and more preferably are adjacent. For example, a 

i r M ■ 'rt 

• :0 promoter is operably linked to a gene if the promoter regulates or mediates transcription of the 

v t n M 

m gene in a cell. 

^ Those skilled in the art will recognize that there are a number of promoters which are 

; B 5 functional in plant cells, and have been described in the literature. Chloroplast and plastid 
specific promoters, chloroplast or plastid functional promoters, and chloroplast or plastid 

■ * >H"^ 
T T PT 

!:« a operable promoters are also envisioned. 

jl * One set of plant functional promoters are constitutive promoters such as the CaMV35S or 

□ FMV35S promoters that yield high levels of expression in most plant organs. Enhanced or 

F 

20 duplicated versions of the CaMV35S and FMV35S promoters are useful in the practice of this 
invention (Odell, et al (1985) Nature 313:810-812; Rogers, U.S. Patent Number 5,378, 619). In 
addition, it may also be preferred to bring about expression of the prenyltransferase or tocopherol 
cyclase gene in specific tissues of the plant, such as leaf, stem, root, tuber, seed, fruit, etc., and 
the promoter chosen should have the desired tissue and developmental specificity. 

25 Of particular interest is the expression of the nucleic acid sequences of the present 

invention from transcription initiation regions which are preferentially expressed in a plant seed 
tissue. Examples of such seed preferential transcription initiation sequences include those 
sequences derived from sequences encoding plant storage protein genes or from genes involved 
in fatty acid biosynthesis in oilseeds. Examples of such promoters include the 5' regulatory 

30 regions from such genes as napin (Kridl et al, Seed Set Res. 7:209:219 (1991)), phaseolin, zein, 
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soybean trypsin inhibitor, ACP, stearoyl-ACP desaturase, soybean a' subunit of p-conglycinin 
(soy 7s ? (Chen et al, Proc. Natl Acad Scl 9 83:8560-8564 (1986))) and oleosin. 

It may be advantageous to direct the localization of proteins conferring prenyltransferase 
or tocopherol cyclase to a particular subcellular compartment, for example, to the mitochondrion, 
endoplasmic reticulum, vacuoles, chloroplast or other plastidic compartment. For example, 
where the genes of interest of the present invention will be targeted to plastids, such as 
chloroplasts, for expression, the constructs will also employ the use of sequences to direct the 
gene to the plastid. Such sequences are referred to herein as chloroplast transit peptides (CTP) or 
plastid transit peptides (PTP). In this manner, where the gene of interest is not directly inserted 
into the plastid, the expression construct will additionally contain a gene encoding a transit 
peptide to direct the gene of interest to the plastid. The chloroplast transit peptides may be 
derived from the gene of interest, or may be derived from a heterologous sequence having a CTP. 
Such transit peptides are known in the art. See, for example, Von Heijne et ah (1991) Plant Mol 
Biol Rep. 9:104-126; Clarke* ah (1989) J. Biol Chem. 2tf¥:17544-17550; della-Cioppa et ah 
(1987) Plant Physiol <W:965-968; Romer et ah (1993) Biochem. Biophys. Res Commun. 
796:1414-1421; and, Shah et ah (1986) Science 253:478-481. 

Depending upon the intended use, the constructs may contain the nucleic acid sequence 
which encodes the entire prenyltransferase or tocopherol cyclase protein, or a portion thereof. 
For example, where antisense inhibition of a given prenyltransferase or tocopherol cyclase 
protein is desired, the entire prenyltransferase or tocopherol cyclase sequence is not required. 
Furthermore, where prenyltransferase or tocopherol cyclase sequences used in constructs are 
intended for use as probes, it may be advantageous to prepare constructs containing only a 
particular portion of a prenyltransferase or tocopherol cyclase encoding sequence, for example a 
sequence which is discovered to encode a highly conserved prenyltransferase or tocopherol 
cyclase region. 

The skilled artisan will recognize that there are various methods for the inhibition of 
expression of endogenous sequences in a host cell Such methods include, but are not limited to, 
antisense suppression (Smith, et al (1988) Nature 334:724-726) , co-suppression (Napoli, et al 
(1989) Plant Cell 2:279-289), ribozymes (PCT Publication WO 97/10328), and combinations of 
sense and antisense Waterhouse, et al (1998) Proc. Natl Acad. Sci. USA 95:13959-13964. 



15 



Attorney Docket No: 
52015/01/US 

Methods for the suppression of endogenous sequences in a host cell typically employ the 
transcription or transcription and translation of at least a portion of the sequence to be 
suppressed. Such sequences may be homologous to coding as well as non-coding regions of the 
endogenous sequence. 

Regulatory transcript termination regions may be provided in plant expression constructs 
of this invention as well. Transcript termination regions may be provided by the DNA sequence 
encoding the prenyltransferase or tocopherol cyclase or a convenient transcription termination 
region derived from a different gene source, for example, the transcript termination region which 
is naturally associated with the transcript initiation region. The skilled artisan will recognize that 
any convenient transcript termination region which is capable of terminating transcription in a 
plant cell may be employed in the constructs of the present invention. 

Alternatively, constructs may be prepared to direct the expression of the prenyltransferase 
or tocopherol cyclase sequences directly from the host plant cell plastid. Such constructs and 
methods are known in the art and are generally described, for example, in Svab, et al (1990) 
Proa Natl Acad. Set USA 87:8526-8530 and Svab and Maliga (1993) Proc. Natl Acad. Set 
USA 90:913-917 and in U.S. Patent Number 5,693,507. 

The prenyltransferase or tocopherol cyclase constructs of the present invention can be 
used in transformation methods with additional constructs providing for the expression of other 
nucleic acid sequences encoding proteins involved in the production of tocopherols, or 
tocopherol precursors such as homogentisic acid and/or phytylpyrophosphate. Nucleic acid 
sequences encoding proteins involved in the production of homogentisic acid are known in the 
art, and include but not are limited to, 4-hydroxyphenylpyruvate dioxygenase (HPPD, EC 
1.13.11.27) described for example, by Garcia, et al ((1999) Plant Physiol 11 9(4): 1507-1 5 16), 
mono or Afunctional tyrA (described for example by Xia, et al (1992) J. Gen Microbiol 
138:1309-1316, and Hudson, et al (1984) J. Mol Biol 180:1023-1051), Oxygenase, 4- 
hydroxyphenylpyruvate di- (9CI), 4-Hydroxyphenylpyruvate dioxygenase; p- 
Hydroxyphenylpyruvate dioxygenase; p-Hydroxyphenylpyruvate hydroxylase; p- 
Hydroxyphenylpyruvate oxidase; p-Hydroxyphenylpyruvic acid hydroxylase; p- 
Hydroxyphenylpyruvic hydroxylase; p-Hydroxyphenylpyruvic oxidase), 4-hydroxyphenylacetate, 
NAD(P)H:oxygen oxidoreductase (1-hydroxylating); 4-hydroxyphenylacetate 1 -monooxygenase, 
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and the like. In addition, constructs for the expression of nucleic acid sequences encoding 
proteins involved in the production of phytylpyrophosphate can also be employed with the 
prenyltransferase or tocopherol cyclase constructs of the present invention. Nucleic acid 
sequences encoding proteins involved in the production of phytylpyrophosphate are known in the 
5 art, and include, but are not limited to geranylgeranylpyrophosphate synthase (GGPPS), 
geranylgeranylpyrophosphate reductase (GGH), l-deoxyxylulose-5 -phosphate synthase, 1- 
deoxy-D-xylolose-5-phosphate reductoisomerase, 4-diphosphocytidyl-2-C-methylerythritol 
synthase, isopentyl pyrophosphate isomerase. 

The prenyltransferase or tocopherol cyclase sequences of the present invention find use in 
10 the preparation of transformation constructs having a second expression cassette for the 
expression of additional sequences involved in tocopherol biosynthesis. Additional tocopherol 
; y biosynthesis sequences of interest in the present invention include, but are not limited to gamma- 
ill tocpherol methyltransferase (Shintani, et al (1998) Science 282(5396):2098-2100), tocopherol 
cyclase, and tocopherol methyltransferase. 

t . kW 

5 A plant cell, tissue, organ, or plant into which the recombinant DNA constructs 

containing the expression constructs have been introduced is considered transformed, transfected, 

* *» 

* K 

. , .i-mw, 

> ■ 

or transgenic. A transgenic or transformed cell or plant also includes progeny of the cell or plant 
[: Z and progeny produced from a breeding program employing such a transgenic plant as a parent in 

* 

□ a cross and exhibiting an altered phenotype resulting from the presence of a prenyltransferase or 

"2 0 tocopherol cyclase nucleic acid sequence. 

Plant expression or transcription constructs having a prenyltransferase or tocopherol 
cyclase as the DNA sequence of interest for increased or decreased expression thereof may be 
employed with a wide variety of plant life, particularly, plant life involved in the production of 
vegetable oils for edible and industrial uses. Particularly preferred plants for use in the methods 

2 5 of the present invention include, but are not limited to: Acacia, alfalfa, aneth, apple, apricot, 

artichoke, arugula, asparagus, avocado, banana, barley, beans, beet, blackberry, blueberry, 
broccoli, brussels sprouts, cabbage, canola, cantaloupe, carrot, cassava, cauliflower, celery, 
cherry, chicory, cilantro, citrus, Clementines, coffee, corn, cotton, cucumber, Douglas fir, 
eggplant, endive, escarole, eucalyptus, fennel, figs, garlic, gourd, grape, grapefruit, honey dew, 

3 0 jicama, kiwifruit, lettuce, leeks, lemon, lime, Loblolly pine, mango, melon, mushroom, nectarine, 
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nut, oat, oil palm, oil seed rape, okra, onion, orange, an ornamental plant, papaya, parsley, pea, 
peach, peanut, pear, pepper, persimmon, pine, pineapple, plantain, plum, pomegranate, poplar, 
potato, pumpkin, quince, radiata pine, radicchio, radish, raspberry, rice, rye, sorghum, Southern 
pine, soybean, spinach, squash, strawberry, sugarbeet, sugarcane, sunflower, sweet potato, 
sweetgum, tangerine, tea, tobacco, tomato, triticale, turf, turnip, a vine, watermelon, wheat, yams, 
and zucchini. 

Most especially preferred are temperate oilseed crops. Temperate oilseed crops of 
interest include, but are not limited to, rapeseed (Canola and High Erucic Acid varieties), 
sunflower, safflower, cotton, soybean, peanut, coconut and oil palms, and corn. Depending on 
the method for introducing the recombinant constructs into the host cell, other DNA sequences 
may be required. Importantly, this invention is applicable to dicotyledyons and monocotyledons 
species alike and will be readily applicable to new and/or improved transformation and 
regulation techniques. 

Of particular interest, is the use of prenyltransferase or tocopherol cyclase constructs in 
plants to produce plants or plant parts, including, but not limited to leaves, stems, roots, 
reproductive, and seed, with a modified content of tocopherols in plant parts having transformed 
plant cells. 

For immunological screening, antibodies to the protein can be prepared by injecting 
rabbits or mice with the purified protein or portion thereof, such methods of preparing antibodies 
being well known to those in the art. Either monoclonal or polyclonal antibodies can be 
produced, although typically polyclonal antibodies are more useful for gene isolation. Western 
analysis may be conducted to determine that a related protein is present in a crude extract of the 
desired plant species, as determined by cross-reaction with the antibodies to the encoded 
proteins. When cross-reactivity is observed, genes encoding the related proteins are isolated by 
screening expression libraries representing the desired plant species. Expression libraries can be 
constructed in a variety of commercially available vectors, including lambda gtl 1, as described in 
Sambrook, et al. {Molecular Cloning: A Laboratory Manual^ Second Edition (1989) Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York). 

To confirm the activity and specificity of the proteins encoded by the identified nucleic 
acid sequences as prenyltransferase or tocopherol cyclase enzymes, in vitro assays are performed 
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in insect cell cultures using baculovirus expression systems. Such baculovirus expression 
systems are known in the art and are described by Lee, et al U.S. Patent Number 5,348,886, the 
entirety of which is herein incorporated by reference. 

In addition, other expression constructs may be prepared to assay for protein activity 
utilizing different expression systems. Such expression constructs are transformed into yeast or 
prokaryotic host and assayed for prenyltransferase or tocopherol cyclase activity. Such 
expression systems are known in the art and are readily available through commercial sources. 

In addition to the sequences described in the present invention, DNA coding sequences 
useful in the present invention can be derived from algae, fungi, bacteria, mammalian sources, 
plants, etc. Homology searches in existing databases using signature sequences corresponding 
to conserved nucleotide and amino acid sequences of prenyltransferase or tocopherol cyclase can 
be employed to isolate equivalent, related genes from other sources such as plants and 
microorganisms. Searches in EST databases can also be employed. Furthermore, the use of 
DNA sequences encoding enzymes functionally enzymatically equivalent to those disclosed 
herein, wherein such DNA sequences are degenerate equivalents of the nucleic acid sequences 
disclosed herein in accordance with the degeneracy of the genetic code, is also encompassed by 
the present invention. Demonstration of the functionality of coding sequences identified by any 
of these methods can be carried out by complementation of mutants of appropriate organisms, 
such as Synechocystis, Shewanella, yeast, Pseudomonas, Rhodobacteria, etc., that lack specific 
biochemical reactions, or that have been mutated. The sequences of the DNA coding regions 
can be optimized by gene resynthesis, based on codon usage, for maximum expression in 
particular hosts. 

For the alteration of tocopherol production in a host cell, a second expression construct 
can be used in accordance with the present invention. For example, the prenyltransferase or 
tocopherol cyclase expression construct can be introduced into a host cell in conjunction with a 
second expression construct having a nucleotide sequence for a protein involved in tocopherol 
biosynthesis. 

The method of transformation in obtaining such transgenic plants is not critical to the 
instant invention, and various methods of plant transformation are currently available. 
Furthermore, as newer methods become available to transform crops, they may also be directly 
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applied hereunder. For example, many plant species naturally susceptible to Agrobacterium 
infection may be successfully transformed via tripartite or binary vector methods of 
Agrobacterium mediated transformation. In many instances, it will be desirable to have the 
construct bordered on one or both sides by T-DNA, particularly having the left and right borders, 
5 more particularly the right border. This is particularly useful when the construct uses A. 

tumefaciens or A. rhizogenes as a mode for transformation, although the T-DNA borders may 
find use with other modes of transformation. In addition, techniques of microinjection, DNA 
particle bombardment, and electroporation have been developed which allow for the 
transformation of various monocot and dicot plant species. 
1 0 Normally, included with the DNA construct will be a structural gene having the necessary 

^ regulatory regions for expression in a host and providing for selection of transformant cells. The 
^0 gene may provide for resistance to a cytotoxic agent, e.g. antibiotic, heavy metal, toxin, etc., 
m complementation providing prototrophy to an auxotrophic host, viral immunity or the like. 

t f n 

^ Depending upon the number of different host species the expression construct or components 
j Jf 5 thereof are introduced, one or more markers may be employed, where different conditions for 

1 , Pi 

i; selection are used for the different hosts. 

\'Z s Where Agrobacterium is used for plant cell transformation, a vector may be used which 

1 f may be introduced into the Agrobacterium host for homologous recombination with T-DNA or 

if** 

q the Ti- or Ri-plasmid present in the Agrobacterium host. The Ti- or Ri-plasmid containing the T- 
20 DNA for recombination may be armed (capable of causing gall formation) or disarmed 

(incapable of causing gall formation), the latter being permissible, so long as the vir genes are 
present in the transformed Agrobacterium host. The armed plasmid can give a mixture of normal 
plant cells and gall. 

In some instances where Agrobacterium is used as the vehicle for transforming host plant 

2 5 cells, the expression or transcription construct bordered by the T-DNA border region(s) will be 

inserted into a broad host range vector capable of replication in E. coli and Agrobacterium, there 
being broad host range vectors described in the literature. Commonly used is pRK2 or 
derivatives thereof. See, for example, Ditta, et aL, (Proc. Nat Acad Sci, U.S.A. (1980) 
77:7347-7351) and EPA 0 120 515, which are incorporated herein by reference. Alternatively, 

3 0 one may insert the sequences to be expressed in plant cells into a vector containing separate 
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replication sequences, one of which stabilizes the vector in E. coli, and the other in 
Agrobacterium. See, for example, McBride, et al {Plant Mol Biol (1990) 74:269-276), wherein 
the pRiHRI (Jouanin, et al, Mol Gen. Genet. (1985) 201 :370-374) origin of replication is 
utilized and provides for added stability of the plant expression vectors in host Agrobacterium 
cells. 

Included with the expression construct and the T-DNA will be one or more markers, 
which allow for selection of transformed Agrobacterium and transformed plant cells. A 
number of markers have been developed for use with plant cells, such as resistance to 
chloramphenicol, kanamycin, the aminoglycoside G418, hygromycin, or the like. The particular 
marker employed is not essential to this invention, one or another marker being preferred 
depending on the particular host and the manner of construction. 

For transformation of plant cells using Agrobacterium, explants may be combined and 
incubated with the transformed Agrobacterium for sufficient time for transformation, the bacteria 
killed, and the plant cells cultured in an appropriate selective medium. Once callus forms, shoot 
formation can be encouraged by employing the appropriate plant hormones in accordance with 
known methods and the shoots transferred to rooting medium for regeneration of plants. The 
plants may then be grown to seed and the seed used to establish repetitive generations and for 
isolation of vegetable oils. 

There are several possible ways to obtain the plant cells of this invention which contain 
multiple expression constructs. Any means for producing a plant comprising a construct having 
a DNA sequence encoding the expression construct of the present invention, and at least one 
other construct having another DNA sequence encoding an enzyme are encompassed by the 
present invention. For example, the expression construct can be used to transform a plant at the 
same time as the second construct either by inclusion of both expression constructs in a single 
transformation vector or by using separate vectors, each of which express desired genes. The 
second construct can be introduced into a plant which has already been transformed with the 
prenyltransferase or tocopherol cyclase expression construct, or alternatively, transformed plants, 
one expressing the prenyltransferase or tocopherol cyclase construct and one expressing the 
second construct, can be crossed to bring the constructs together in the same plant. 
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Transgenic plants of the present invention may be produced from tissue culture, and 
subsequent generations grown from seed. Alternatively, transgenic plants may be grown using 
apomixis. Apomixis is a genetically controlled method of reproduction in plants where the 
embryo is formed without union of an egg and a sperm. There are three basic types of apomictic 
reproduction: 1) apospory where the embryo develops from a chromosomally unreduced egg in 
an embryo sac derived from the nucleus, 2) diplospory where the embryo develops from an 
unreduced egg in an embryo sac derived from the megaspore mother cell, and 3) adventitious 
embryony where the embryo develops directly from a somatic cell In most forms of apomixis, 
pseudogamy or fertilization of the polar nuclei to produce endosperm is necessary for seed 
viability. In apospory, a nurse cultivar can be used as a pollen source for endosperm formation in 
seeds. The nurse cultivar does not affect the genetics of the aposporous apomictic cultivar since 
the unreduced egg of the cultivar develops parthenogenetically, but makes possible endosperm 
production. Apomixis is economically important, especially in transgenic plants, because it 
causes any genotype, no matter how heterozygous, to breed true. Thus,with apomictic 
reproduction, heterozygous transgenic plants can maintain their genetic fidelity throughout 
repeated life cycles. Methods for the production of apomictic plants are known in the art. See, 
U.S. Patent No. 5,81 1,636, which is herein incorporated by reference in its entirety. 

The nucleic acid sequences of the present invention can be used in constructs to provide 
for the expression of the sequence in a variety of host cells, both prokaryotic eukaryotic. Host 
cells of the present invention preferably include monocotyledenous and dicotyledenous plant 
cells. 

In general, the skilled artisan is familiar with the standard resource materials which 
describe specific conditions and procedures for the construction, manipulation and isolation of 
macromolecules (e.g., DNA molecules, plasmids, etc.), generation of recombinant organisms and 
the screening and isolating of clones, (see for example, Sambrook et al, Molecular Cloning; A 
Laboratory Manual, Cold Spring Harbor Press (1989); Maliga et al, Methods in Plant 
Molecular Biology, Cold Spring Harbor Press (1995), the entirety of which is herein incorporated 
by reference; Birren et al, Genome Analysis: Analyzing DNA, 1, Cold Spring Harbor, New York, 
the entirety of which is herein incorporated by reference). 
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Methods for the expression of sequences in insect host cells axe known in the art. 
Baculovirus expression vectors are recombinant insect viruses in which the coding sequence for a 

chosen foreign gene has been inserted behind a baculovirus promoter in place of the viral gene, 
e.g., polyhedrin (Smith and Summers, U.S. Pat. No., 4,745,051, the entirety of which is 
5 incorporated herein by reference). Baculovirus expression vectors are known in the art, and are 
described for example inDoerfler, Curr. Top. Microbiol Immunol 737:51-68 (1968); Luckow 
and Summers, Bio/Technology 6:47-55 (1988a); Miller, Annual Review of Microbiol 42:177-199 
(1988); Summers, Curr. Comm. Molecular Biology, Cold Spring Harbor Press, Cold Spring 
Harbor, N.Y. (1988); Summers and Smith, A Manual of Methods for Baculovirus Vectors and 
1 0 Insect Cell Culture Procedures, Texas Ag. Exper. Station Bulletin No. 1555 (1988), the 
entireties of which is herein incorporated by reference) 

4< till F» 

y Methods for the expression of a nucleic acid sequence of interest in a fungal host cell are 

H 

* V- H 

i";o known in the art. The fungal host cell may, for example, be a yeast cell or a filamentous fungal 

♦ tf " 

cell. Methods for the expression of DNA sequences of interest in yeast cells are generally 

- pWt 

. JTF-»F 

^ 5 described in "Guide to yeast genetics and molecular biology", Guthrie and Fink, eds. Methods in 

* « 

enzymology , Academic Press, Inc. Vol 194 (1991) and Gene expression technology", Goeddel 
;^ ed, Methods in Enzymology, Academic Press, Inc., Vol 185 (1991). 

^ Mammalian cell lines available as hosts for expression are known in the art and include 

F * niKtt 
« + Hp*«M 

Q many immortalized cell lines available from the American Type Culture Collection (ATCC, 

P M^fTf 

:: 2 0 Manassas, VA), such as HeLa cells, Chinese hamster ovary (CHO) cells, baby hamster kidney 
(BHK) cells and a number of other cell lines. Suitable promoters for mammalian cells are also 
known in the art and include, but are not limited to, viral promoters such as that from Simian 
Virus 40 (SV40) (Fiers et al, Nature 273:1 13 (1978), the entirety of which is herein incorporated 
by reference), Rous sarcoma virus (RSV), adenovirus (ADV) and bovine papilloma virus (BPV). 

2 5 Mammalian cells may also require terminator sequences and poly-A addition sequences. 

Enhancer sequences which increase expression may also be included and sequences which 
promote amplification of the gene may also be desirable (for example methotrexate resistance 
genes). 

Vectors suitable for replication in mammalian cells are well known in the art, and may 

3 0 include viral replicons, or sequences which insure integration of the appropriate sequences 
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encoding epitopes into the host genome. Plasmid vectors that greatly facilitate the construction of 
recombinant viruses have been described (see, for example, Mackett et al 9 J Virol 49:857 
(1984); Chakrabarti etaL, Mol Cell Biol 5:3403 (1985); Moss, In: Gene Transfer Vectors For 
Mammalian Cells (Miller and Calos, eds., Cold Spring Harbor Laboratory, N.Y., p. 10, (1987); 
all of which are herein incorporated by reference in their entirety). 

The invention also includes plants and plant parts, such as seed, oil and meal derived 
from seed, and feed and food products processed from plants, which are enriched in tocopherols. 
Of particular interest is seed oil obtained from transgenic plants where the tocopherol level has 
been increased as compared to seed oil of a non-transgenic plant. 

The harvested plant material may be subjected to additional processing to further enrich 
the tocopherol content. The skilled artisan will recognize that there are many such processes or 
methods for refining, bleaching and degumming oil. United States Patent Number 5,932,261, 
issued August 3, 1999, discloses on such process, for the production of a natural carotene rich 
refined and deodorised oil by subjecting the oil to a pressure of less than 0.060 mbar and to a 
temperature of less than 200. degree. C. Oil distilled by this process has reduced free fatty acids, 
yielding a refined, deodorised oil where Vitamin E contained in the feed oil is substantially 
retained in the processed oil. The teachings of this patent are incorporated herein by reference. 

The invention now being generally described, it will be more readily understood by 
reference to the following examples which are included for purposes of illustration only and are 
not intended to limit the present invention. 

EXAMPLES 

Example 1: Identification of Prenyltransferase or tocopherol cyclase Sequences 

PSI-BLAST (Altschul, et al (1997) Nuc Acid Res 25:3389-3402) profiles were generated 
for both the straight chain and aromatic classes of prenyltransferases. To generate the straight 
chain profile, a prenyl- transferase from Porphyra purpurea (Genbank accession 1709766) was 
used as a query against the NCBI non-redundant protein database. The E. coli enzyme involved 
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in the formation of ubiquinone, ubiA (genbank accession 1790473) was used as a starting 
sequence to generate the aromatic prenyltransferase profile. These profiles were used to search 
public and proprietary DNA and protein data bases. In Arabidopsis six putative 
prenyltransferases of the straight-chain class were identified, ATPT1, (SEQ ID NO:9), ATPT7 
5 (SEQ ID NO: 10), ATPT8 (SEQ ID NO: 1 1), ATPT9 (SEQ ID NO: 13), ATPT10 (SEQ ID 
NO: 14), and ATPT1 1 (SEQ ID NO: 15), and six were identified of the aromatic class, ATPT2 
(SEQ ID NO:l), ATPT3 (SEQ ID NO:3), ATPT4 (SEQ ID NO:5), ATPT5 (SEQ ID NO:7), 
ATPT6 (SEQ ID NO:8), and ATPT12 (SEQ ID NO: 16). Additional prenyltransferase sequences 
from other plants related to the aromatic class of prenyltransferases, such as soy (SEQ ID NOs: 
1 0 1 9-23, the deduced amino acid sequence of SEQ ID NO:23 is provided in SEQ ID NO:24) and 
maize (SEQ ID NOs:25-29, and 3 1) are also identified. The deduced amino acid sequence of 
; 0 ZMPT5 (SEQ ID NO:29) is provided in SEQ ID NO:30. 

■ 

* ml 

m Searches are performed on a Silicon Graphics Unix computer using additional 

* u H 

j;» 5 Bioaccellerator hardware and GenWeb software supplied by Compugen Ltd. This software and 
; § 5 hardware enables the use of the Smith- Waterman algorithm in searching DNA and protein 

. V It 

databases using profiles as queries. The program used to query protein databases is profilesearch. 

This is a search where the query is not a single sequence but a profile based on a multiple 
^ alignment of amino acid or nucleic acid sequences. The profile is used to query a sequence data 
□ set, i.e., a sequence database. The profile contains all the pertinent information for scoring each 
2 0 position in a sequence, in effect replacing the "scoring matrix" used for the standard query 

searches. The program used to query nucleotide databases with a protein profile is tprofilesearch. 

Tprofilesearch searches nucleic acid databases using an amino acid profile query. As the search is 

running, sequences in the database are translated to amino acid sequences in six reading frames. 

The output file for tprofilesearch is identical to the output file for profilesearch except for an 
2 5 additional column that indicates the frame in which the best alignment occurred. 

The Smith- Waterman algorithm, (Smith and Waterman (1981) supra), is used to search 

for similarities between one sequence from the query and a group of sequences contained in the 

database. E score values as well as other sequence information, such as conserved peptide 

sequences are used to identify related sequences. 
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To obtain the entire coding region corresponding to the Arabidopsis prenyltransferase 
sequences, synthetic oligo-nucleotide primers are designed to amplify the 5 ? and 3 ? ends of 
partial cDNA clones containing prenyltransferase sequences. Primers are designed according to 
the respective Arabidopsis prenyltransferase sequences and used in Rapid Amplification of 
5 cDNA Ends (RACE) reactions (Frohman et al (1988) Proa Natl Acad Sci USA 85:8998-9002) 
using the Marathon cDNA amplification kit (Clontech Laboratories Inc ? Palo Alto, CA). 

Amino acid sequence alignments between ATPT2 (SEQ ID NO:2) ? ATPT3 (SEQ ID 
NO:4), ATPT4 (SEQ ID NO:6) ? ATPT8 (SEQ ID NO: 12), and ATPT12 (SEQ ID NO:17) are 
performed using ClustalW (Figure 1), and the percent identity and similarities are provided in 
10 Table 1 below. 

0 Table 1: 
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Example 2: Preparation of Prenyl Transferase Expression Constructs 
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A plasmid containing the napin cassette derived from pCGN3223 (described in USPN 
5,639,790, the entirety of which is incorporated herein by reference) was modified to make it 
more useful for cloning large DNA fragments containing multiple restriction sites, and to allow 
the cloning of multiple napin fusion genes into plant binary transformation vectors. An adapter 
comprised of the self annealed oligonucleotide of sequence 

CGCGATTTAAATGGCGCGCCCTGCAGGCGGCCGCCTGCAGGGCGCGCCATTTAAAT 
(SEQ ID NO:40) was ligated into the cloning vector pBC SK+ (Stratagene) after digestion with 
the restriction endonuclease BssHII to construct vector pCGN7765. Plamids pCGN3223 and 
pCGN7765 were digested with NotI and ligated together. The resultant vector, pCGN7770, 
contains the pCGN7765 backbone with the napin seed specific expression cassette from 
pCGN3223. 

The cloning cassette, pCGN7787, essentially the same regulatory elements as 
pCGN7770, with the exception of the napin regulatory regions of pCGN7770 have been replaced 
with the double CAMV 35S promoter and the tml polyadenylation and transcriptional 
termination region. 

A binary vector for plant transformation, pCGN5139, was constructed from pCGN1558 
(McBride and Summerfelt, (1990) Plant Molecular Biology, 14:269-276). The polylinker of 
pCGN1558 was replaced as a HindIII/Asp718 fragment with a polylinker containing unique 
restriction endonuclease sites, AscI, Pad, Xbal, Swal, BamHI, and NotI. The Asp718 and 
Hindlll restriction endonuclease sites are retained in pCGN5139. 

A series of turbo binary vectors are constructed to allow for the rapid cloning of DNA 
sequences into binary vectors containing transcriptional initiation regions (promoters) and 
transcriptional termination regions. 

The plasmid pCGN8618 was constructed by ligating oligonucleotides 5 5 - 
TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGG-3' (SEQ ID NO:41) and 5'- 
TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC-3' (SEQ ID NO:42) into Sall/Xhol- 
digested pCGN7770. A fragment containing the napin promoter, polylinker and napin 3 5 region 
was excised from pCGN8618 by digestion with Asp718I; the fragment was blunt-ended by filling 
in the 5' overhangs with Klenow fragment then ligated into pCGN5139 that had been digested 
with Asp718I and Hindlll and blunt-ended by filling in the 5' overhangs with Klenow fragment. 
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A plasmid containing the insert oriented so that the napin promoter was closest to the blunted 
Asp718I site of pCGN5139 and the napin 3' was closest to the blunted Hindlll site was subjected 
to sequence analysis to confirm both the insert orientation and the integrity of cloning junctions. 

The resulting plasmid was designated pCGN8622. 
5 The plasmid pCGN8619 was constructed by ligating oligonucleotides 5'- 

TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC -3' (SEQ ID NO:43) and 5'- 
TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGG-3 5 (SEQ ID NO:44) into Sall/Xhol- 
digested pCGN7770. A fragment containing the napin promoter, polylinker and napin 3' region 
was removed from pCGN8619 by digestion with Asp718I; the fragment was blunt-ended by 
1 0 filling in the 5' overhangs with Klenow fragment then ligated into pCGN5 139 that had been 
digested with Asp718I and Hindlll and blunt-ended by filling in the 5' overhangs with Klenow 

I* P. bit 

[j£ fragment. A plasmid containing the insert oriented so that the napin promoter was closest to the 

■ w >i 

1 - — ■ J M 

i:o blunted Asp718I site of pCGN5139 and the napin 3' was closest to the blunted Hindlll site was 
subjected to sequence analysis to confirm both the insert orientation and the integrity of cloning 
; | 5 junctions. The resulting plasmid was designated pCGN8623. 

» ■* -> 
- - i ■ * » 

The plasmid pCGN8620 was constructed by ligating oligonucleotides 5'- 

□ TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGGAGCT -3' (SEQ ID NO:45) and 5'- 
l -"t CCTGCAGGAAGCTTGCGGCCGCGGATCC-3 ' (SEQ ID NO:46) into Sall/SacI-digested 

■■ - >IW^ 
r > p*W 

□ pCGN7787. A fragment containing the d35S promoter, polylinker and tml 3' region was 

2 0 removed from pCGN8620 by complete digestion with Asp71 81 and partial digestion with Notl. 

The fragment was blunt-ended by filling in the 5' overhangs with Klenow fragment then ligated 
into pCGN5 1 39 that had been digested with Asp71 81 and Hindlll and blunt-ended by filling in 
the 5' overhangs with Klenow fragment. A plasmid containing the insert oriented so that the 
d35S promoter was closest to the blunted Asp718I site of pCGN5139 and the tml 3' was closest 
25 to the blunted Hindlll site was subjected to sequence analysis to confirm both the insert 
orientation and the integrity of cloning junctions. The resulting plasmid was designated 
pCGN8624. 

The plasmid pCGN8621 was constructed by ligating oligonucleotides 5'- 
TCGACCTGCAGGAAGCTTGCGGCCGCGGATCCAGCT -3 ? (SEQ ID NO:47) and 5'- 

3 0 GGATCCGCGGCCGCAAGCTTCCTGCAGG-3 ' (SEQ ID NO:48) into Sall/SacI-digested 
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pCGN7787. A fragment containing the d35S promoter, polylinker and tml 3' region was 
removed from pCGN8621 by complete digestion with Asp718I and partial digestion with NotL 
The fragment was blunt-ended by filling in the 5' overhangs with Klenow fragment then ligated 
into pCGN5139 that had been digested with Asp718I and Hindlll and blunt-ended by filling in 
5 the 5' overhangs with Klenow fragment. A plasmid containing the insert oriented so that the 
d35S promoter was closest to the blunted Asp718I site of pCGN5139 and the tml 3' was closest 
to the blunted Hindlll site was subjected to sequence analysis to confirm both the insert 
orientation and the integrity of cloning junctions. The resulting plasmid was designated 
pCGN8625. 

1 0 The plasmid construct pCGN8640 is a modification of pCGN8624 described above. A 

^ 938bp PstI fragment isolated from transposon Tn7 which encodes bacterial spectinomycin and 
v| streptomycin resistance (Fling et al. (1985), Nucleic Acids Research 13(19):7095-7106), a 
;Yj determinant for E. coli and Agrobacterium selection, was blunt ended with Pfu polymerase. The 
jff blunt ended fragment was ligated into pCGN8624 that had been digested with Spel and blunt 
jp 5 ended with Pfu polymerase. The region containing the PstI fragment was sequenced to confirm 
" both the insert orientation and the integrity of cloning junctions. 

!;;! The spectinomycin resistance marker was introduced into pCGN8622 and pCGN8623 as 

!^ follows. A 7.7 Kbp Avrll-SnaBI fragment from pCGN8640 was ligated to a 1 0.9 Kbp Avrll- 
3 SnaBI fragment from pCGN8623 or pCGN8622, described above. The resulting plasmids were 

1 i 0 pCGN864 1 and pCGN8643 , respectively. 

The plasmid pCGN8644 was constructed by ligating oligonucleotides 5'- 
GATCACCTGCAGGAAGCTTGCGGCCGCGGATCCAATGCA-3' (SEQ ID NO:49) and 5'- 
TTGGATCCGCGGCCGCAAGCTTCCTGCAGGT-3' (SEQ ID NO:50) into BamHI-PstI 
digested pCGN8640. 

2 5 Synthetic oligonulceotides were designed for use in Polymerase Chain Reactions (PCR) to 

amplify the coding sequences of ATPT2, ATPT3, ATPT4, ATPT8, and ATPT12 for the preparation 
of expression constructs and are provided in Table 2 below. 



Table 2: 



Name 


Restriction Site 


Sequence 


SEQ ID NO: 


ATPT2 


5' NotI 


GGATCCGCGGCCGCACAATGGAGTC 


51 






TCTGCTCTCTAGTTCT 
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ATPT2 


3' Ssel 


GGATCCTGCAGGTCACTTCAAAAAA 
GGTAACAGCAAGT 


52 


ATPT3 


5' NotI 


GGATCCGCGGCCGCACAATGGCGTT 
TTTTGGGCTCTCCCGTGTTT 


53 


ATPT3 


3' Ssel 


GGATCCTGCAGGTTATTGAAAACTT 
CTTCCAAGTACAACT 


54 


ATPT4 


5' NotI 


GGATCCGCGGCCGCACAATGTGGCG 
AAGATCTGTTGTT 


55 


ATPT4 


3' Ssel 


GGATCCTGCAGGTCATGGAGAGTAG 
AAGGAAGGAGCT 


56 


ATPT8 


5' NotI 


GGATCCGCGGCCGCACAATGGTACT 
TGCCGAGGTTCCAAAGCTTGCCTCT 


57 


ATPT8 


3' Ssel 


GGATCCTGCAGGTCACTTGTTTCTG 
GTGATGACTCTAT 


58 


ATPT12 


5' NotI 


GGATCCGCGGCCGCACAATGACTTC 
GATTCTCAACACT 


59 


ATPT12 


3' Ssel 


GGATCCTGCAGGTCAGTGTTGCGAT 
GCTAATGCCGT 


60 



The coding sequences of ATPT2, ATPT3, ATPT4, ATPT8, and ATPT12 were all amplified 
using the respective PCR primers shown in Table 2 above and cloned into the TopoTA vector 
(Invitrogen). Constructs containing the respective prenyltransferase sequences were digested with 
NotI and Sse8387I and cloned into the turbobinary vectors described above. 

The sequence encoding ATPT2 prenyltransferase was cloned in the sense orientation into 
pCGN8640 to produce the plant transformation construct pCGN10800 (Figure 2). The ATPT2 
sequence is under control of the 35S promoter. 

The ATPT2 sequence was also cloned in the anti sense orientation into the construct 
pCGN8641 to create pCGN 10801 (Figure 3). This construct provides for the antisense expression of 
the ATPT2 sequence from the napin promoter. 

The ATPT2 coding sequence was also cloned in the sense orientation into the vector 
pCGN8643 to create the plant transformation construct pCGN 10822 

The ATPT2 coding sequence was also cloned in the antisense orientation into the vector 
pCGN8644 to create the plant transformation construct pCGN 10803 (Figure 4). 

The ATPT4 coding sequence was cloned into the vector pCGN864 to create the plant 
transformation construct pCGN 10806 (Figure 5). The ATPT2 coding sequence was cloned into the 
vector TopoTA ™ vector from Invitrogen, to create the plant transformation construct 
pCGN10807(Figure 6). The ATPT3 coding sequence was cloned into the TopoTA vector to create 
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the plant transformation construct pCGN10808 (Figure 7). The ATPT3 coding sequence was cloned 
in the sense orientation into the vector pCGN8640 to create the plant transformation construct 
pCGN 10809 (Figure 8). The ATPT3 coding sequence was cloned in the antisense orientation into the 
vector pCGN8641 to create the plant transformation construct pCGN10810 (Figure 9). The ATPT3 
coding sequence was cloned into the vector pCGN8643 to create the plant transformation construct 
pCGN1081 1 (Figure 10). The ATPT3 coding sequence was cloned into the vector pCGN8644 to 
create the plant transformation construct pCGN10812 (Figure 1 1). The ATPT4 coding sequence was 
cloned into the vector pCGN8640 to create the plant transformation construct pCGN10813 (Figure 
12). The ATPT4 coding sequence was cloned into the vector pCGN8641 to create the plant 
transformation construct pCGN10814 (Figure 13). The ATPT4 coding sequence was cloned into the 
vector pCGN8643 to create the plant transformation construct pCGN10815 (Figure 14). The ATPT4 
coding sequence was cloned in the antisense orientation into the vector pCGN8644 to create the 
plant transformation construct pCGN10816 (Figure 15). The ATPT8 coding sequence was cloned in 
the sense orientation into the vector pCGN8643 to create the plant transformation construct 
pCGN10819 (Figure 17). The ATPT12 coding sequence was cloned into the vector pCGN8640 to 
create the plant transformation construct pCGN 10824 (Figure 18). The ATPT12 coding sequence 
was cloned into the vector pCGN8643 to create the plant transformation construct pCGN10825 
(Figure 19). The ATPT8 coding sequence was cloned into the vector pCGN8640 to create the plant 
transformation construct pCGN10826 (Figure 20). 

Example 3: Plant Transformation with Prenyl Transferase Constructs 

Transgenic Brassica plants are obtained by Agrobacterium-mQdmted transformation as 
described by Radke et al (Theor. Appl Genet (1988) 75:685-694; Plant Cell Reports (1992) 
77:499-505). Transgenic Arabidopsis thaliana plants may be obtained by Agrobacterium- 
mediated transformation as described by Valverkens et al, (Proc. Nat Acad. Sci (1988) 
55:5536-5540), or as described by Bent et al. ((1994), Science 265:1856-1860), or Bechtold et al. 
((1993), C&AcadSci, Life Sciences 316:1 194-1 199). Other plant species may be similarly 
transformed using related techniques. 
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Alternatively, microprojectile bombardment methods, such as described by Klein et aL 
{Bio/Technology 70:286-291) may also be used to obtain nuclear transformed plants. 

Example 4: Identification of Additional Prenyltransferases 
5 Additional BLAST searches were performed using the ATPT2 sequence, a sequence in 

the class of aromatic prenyltransferases. ESTs, and in some case, full-length coding regions, 
were identified in proprietary DNA libraries. 

Soy full-length homologs to ATPT2 were identified by a combination of BLAST (using 
ATPT2 protein sequence) and 5' RACE. Two homologs resulted (SEQ ID NO:95 and SEQ ID 
1 0 NO:96). Translated amino acid sequences are provided by SEQ ID NO:97 and SEQ ID NO:98. 

A rice est ATPT2 homo log is shown in SEQ ID NO: 99 (obtained from BLAST using the 
^ij wheat ATPT2 homolog). 

Ku Other homolog sequences were obtained using ATPT2 and PSI-BLAST, including est 

■ rl 

a sequences from wheat (SEQ ID NO: 100), leek (SEQ ID NOs:101 and 102), canola (SEQ ID 
; P NO: 103), corn (SEQ ID NOs:104, 105 and 106), cotton (SEQ ID NO: 107) and tomato (SEQ ID 
NO: 108). 

i;3 A PSI-Blast profile generated using the E. coli ubiA (genbank accession 1790473) 

1 sequence was used to analyze the Synechocystis genome. This analysis identified 5 open reading 
□ frames (ORFs) in the Synechocystis genome that were potentially prenyltransferases; slr0926 

2 0 (annotated as ubiA (4-hydroxybenzoate-octaprenyltransferase, SEQ ID NO:32), sill 899 

(annotated as ctaB (cytocrome c oxidase folding protein, SEQ ID NO:33), slr0056 (annotated as 
g4 (chlorophyll synthase 33 kd subunit, SEQ ID NO:34), slrl518 (annotated as menA 
(menaquinone biosynthesis protein, SEQ ID NO:35), and slrl736 (annotated as a hypothetical 
protein of unknown function (SEQ ID NO: 3 6). 

25 

4A. Synechocystis Knock-outs 

To determine the functionality of these ORFs and their involvement, if any, in the 
biosynthesis of tocopherols, knockouts constructs were made to disrupt the ORF identified in 
Synechocystis. 
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Synthetic oligos were designed to amplify regions from the 5 ' (5 ' - 
TAATGTGTACATTGTCGGCCTC (17365') (SEQ ID NO:61) and 5'- 

GCAATGTAACATCAGAGATTTTGAGACACAACGTGGCTTTCCACAATTCCCCGCACC 
GTC (1736kanprl)) (SEQ ID NO:62) and 3' (5 ' - AGGCTAATAAGC AC AAATGGGA (17363') 
5 (SEQ ID NO:63) and 5'-GGTATGAGTCAGCAACACCTTCTTCACGAGGCAGACCTCAGC 
GGAATTGGTTTAGGTTATCCC (1736kanpr2)) (SEQ ID NO:64) ends of the sir 1736 ORF. 
The 1736kanprl and 1736kanpr2 oligos contained 20 bp of homology to the slrl736 ORF with 
an additional 40 bp of sequence homology to the ends of the kanamycin resistance cassette. 
Separate PCR steps were completed with these oligos and the products were gel purified and 
1 0 combined with the kanamycin resistance gene from puc4K (Pharmacia) that had been digested 
^ with Hindi and gel purified away from the vector backbone. The combined fragments were 



allowed to assemble without oligos under the following conditions: 94°C for 1 min, 55°C for 1 

4-1- *l 

■0 min, 72°C for 1 min plus 5 seconds per cycle for 40 cycles using pfu polymerase in lOOul 

♦ wvwW* 

H 1 

:3 reaction volume (Zhao, H and Arnold (1997) Nucleic Acids Res. 25(6):1307-1308). One 

t- 

% 5 microliter or five microliters of this assembly reaction was then amplified using 5' and 3' oligos 
nested within the ends of the ORF fragment, so that the resulting product contained 100-200 bp 
of the 5' end of the Synechocystis gene to be knocked out, the kanamycin resistance cassette, and 
100-200 bp of the 3' end of the gene to be knocked out. This PCR product was then cloned into 
the vector pGemT easy (Promega) to create the construct pMON21681 and used for 
2 0 Synechocystis transformation. 

Primers were also synthesized for the preparation of Synechocystis knockout constructs 
for the other sequences using the same method as described above, with the following primers. 
The ubiA 5' sequence was amplified using the primers 5'- GGATCCATGGTT 
GCCCAAACCCCATC (SEQ ID NO:65) and 5'- GCAATGTAACATCAGAGA 

2 5 TTTTGAGACACAACG TGGCTTTGGGTAAGCAACAATGACCGGC (SEQ ID NO:66). 

The 3' region was amplified using the synthetic oligonucleotide primers 5'- 
GAATTCTCAAAGCCAGCCCAGTAAC (SEQ ID NO:67) and 5'-GGTATGAGTC 
AGCAACACCTTCTTCACGAGGCAGACCTCAGCGGGTGCGAAAAGGGTTTTCCC (SEQ 
ID NO:68). The amplification products were combined with the kanamycin resistance gene from 

3 0 puc4K (Pharmacia) that had been digested with Hindi and gel purified away from the vector 
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backbone. The annealed fragment was amplified using 5' and 3' oligos nested within the ends of 
the ORF fragment (5'- CCAGTGGTTTAGGCTGTGTGGTC (SEQ ID NO:69) and 5 5 - 
CTG AGTTGGATGT ATTGGATC (SEQ ID NO:70)), so that the resulting product contained 
100-200 bp of the 5' end of the Synechocystis gene to be knocked out, the kanamycin resistance 
5 cassette, and 100-200 bp of the 3' end of the gene to be knocked out. This PCR product was then 
cloned into the vector pGemT easy (Promega) to create the construct pMON21682 and used for 
Synechocystis transformation. 

Primers were also synthesized for the preparation of Synechocystis knockout constructs 
for the other sequences using the same method as described above, with the following primers. 
10 The si 1 1 899 5 9 sequence was amplified using the primers 5 9 - GGATCC ATGGTTACTT 
^ CGAC AAAAATCC (SEQ ID NO : 7 1 ) and 5 ' - GC AATGT AAC ATC AGAG 
B ATTTTGAGACACAACGTGGCTTTGCTAGGCAACCGCTTAGTAC (SEQ ED NO:72). The 
;0 3' region was amplified using the synthetic oligonucleotide primers 5'- 

Sj GAATTCTTAACCCAACAGTAAAGTTCCC (SEQ ID NO:73) and 5'- GGTATGAGTCAGC 
1 5 AACACCTTCTTCACGAGGC AGACCTCAGCGCCGGCATTGTCTTTTAC ATG (SEQ ID 
NO:74). The amplification products were combined with the kanamycin resistance gene from 
^ puc4K (Pharmacia) that had been digested with Hindi and gel purified away from the vector 
~'Z backbone. The annealed fragment was amplified using 5 5 and 3' oligos nested within the ends of 
□ the ORF fragment (5 '- GGAACCCTTGCAGCCGCTTC (SEQ ID NO:75) 

■ mfii 

2 0 and 5'- GTATGCCCAACTGGTGCAGAGG (SEQ ID NO:76)), so that the resulting product 
contained 100-200 bp of the 5' end of the Synechocystis gene to be knocked out, the kanamycin 
resistance cassette, and 100-200 bp of the 3' end of the gene to be knocked out. This PCR 
product was then cloned into the vector pGemT easy (Promega) to create the construct 
pMON21679 and used for Synechocystis transformation. 

2 5 Primers were also synthesized for the preparation of Synechocystis knockout constructs 

for the other sequences using the same method as described above, with the following primers. 
The slr0056 5' sequence was amplified using the primers 5'- 
GGATCCATGTCTGACACACAAAATACCG (SEQ ID NO:77) and 5'- 
GC AATGT AACATCAGAGATTTTGAGACACAACGTGGCTTTCGCCAATACCAGCCACC 

3 0 AAC AG (SEQ ID NO: 7 8). The 3' region was amplified using the synthetic oligonucleotide 
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primers 5'- GAATTCTCAAAT CCCCGCATGGCCTAG (SEQ ID NO:79) and 5'- 
GGTATGAGTCAGCAACACCTTCTTCACGAGGCAGACCTCAGCGGCCTACGGCTTGGA 
CGTGTGGG (SEQ ID NO: 80). The amplification products were combined with the kanamycin 
resistance gene from puc4K (Pharmacia) that had been digested with Hindi and gel purified 
5 away from the vector backbone. The annealed fragment was amplified using 5' and 3' oligos 
nested within the ends of the ORF fragment (5'- CACTTGGATTCCCCTGATCTG (SEQ ID 
NO:81) and 5'- GCAATACCCGCTTGGAAAACG (SEQ ID NO:82)), so that the resulting 
product contained 100-200 bp of the 5' end of the Synechocystis gene to be knocked out, the 
kanamycin resistance cassette, and 100-200 bp of the 3' end of the gene to be knocked out. This 

1 0 PCR product was then cloned into the vector pGemT easy (Promega) to create the construct 

«i pMON21 677 and used for Synechocystis transformation. 

;if Primers were also synthesized for the preparation of Synechocystis knockout constructs 

rv H 

;fl for the other sequences using the same method as described above, with the following primers. 

•a .( 

=^ The slrl518 5' sequence was amplified using the primers 5'- GGATCCATGACCGAAT 

#. rilH> 

5 5 CTTCGCCCCTAGC (SEQ ID NO:83) and 5 ' -GC AATGTA AC ATC AG AG ATTTTGA 

GACACAACGTGGC TTTCAATCCTAGGTAGCCGAGGCG (SEQ ID NO:84). The 3' region 
■■*% was amplified using the synthetic oligonucleotide primers 5'- GAATTCTTAGCCCAGGCC 

1 AGCCCAGCC (SEQ ID NO:85)and 5'- GGTATGAGTCAGCAACACCTTCTTCACGA 
□ GGCAGACCTCAGCGGGGAATTGATTTGTTTAATTACC (SEQ ID NO: 86). The 

f HlHI 

h 

2 0 amplification products were combined with the kanamycin resistance gene from puc4K 

(Pharmacia) that had been digested with Hindi and gel purified away from the vector backbone. 
The annealed fragment was amplified using 5' and 3' oligos nested within the ends of the ORF 
fragment (5 5 - GCGATCGCCATTATCGCTTGG (SEQ ID NO : 8 7) and 5'- 
GCAGACTGGCAATTATCAGTAACG (SEQ ID NO:88)) 9 so that the resulting product 
2 5 contained 100-200 bp of the 5' end of the Synechocystis gene to be knocked out, the kanamycin 
resistance cassette, and 100-200 bp of the 3' end of the gene to be knocked out. This PCR 
product was then cloned into the vector pGemT easy (Promega) to create the construct 
pMON21680 and used for Synechocystis transformation. 

30 4B. Transformation of Synechocystis 
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8 

Cells of Synechocystis 6S03 were grown to a density of approximately 2x10 cells per ml 
and harvested by centrifugation. The cell pellet was re-suspended in fresh BG-1 1 medium 
(ATCC Medium 616) at a density of lxl 0 9 cells per ml and used immediately for transformation. 
One-hundred microliters of these cells were mixed with 5 ul of mini prep DNA and incubated 
with light at 30C for 4 hours. This mixture was then plated onto nylon filters resting on BG-1 1 
agar supplemented with TES pH8 and allowed to grow for 12-18 hours. The filters were then 
transferred to BG-1 1 agar + TES + 5ug/ml kanamycin and allowed to grow until colonies 
appeared within 7-10 days (Packer and Glazer, 1988). Colonies were then picked into BG-1 1 
liquid media containing 5 ug/ml kanamycin and allowed to grow for 5 days. These cells were 
then transferred to Bg-1 1 media containing lOug/ml kanamycin and allowed to grow for 5 days 
and then transferred to Bg-1 1 + kanamycin at 25ug/ml and allowed to grow for 5 days. Cells 
were then harvested for PCR analysis to determine the presence of a disrupted ORF and also for 
HPLC analysis to determine if the disruption had any effect on tocopherol levels. 

PCR analysis of the Synechocystis isolates for slrl736 and sill 899 showed complete 
segregation of the mutant genome, meaning no copies of the wild type genome could be detected 
in these strains. This suggests that function of the native gene is not essential for cell function. 
HPLC analysis of these same isolates showed that the sill 899 strain had no detectable reduction 
in tocopherol levels. However, the strain carrying the knockout for sir 1736 produced no 
detectable levels of tocopherol. 

The amino acid sequences for the Synechocystis knockouts are compared using ClustalW, 
and are provided in Table 3 below. Provided are the percent identities, percent similarity, and the 
percent gap. The alignment of the sequences is provided in Figure 21. 



Table 3: 



Slrl736 


slr0926 


sill 899 


slr0056 


slrl518 


sir 1736 %identity 


14 


12 


18 


11 


%similar 


29 


30 


34 


26 


%gap 


8 


7 


10 


5 


slr0926 %identity 




20 


19 


14 


%similar 




39 


32 


28 



36 
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%gap 


7 9 


4 


sill 899 %identity 


17 


13 


%similar 


29 


29 


%gap 


12 


9 


slr0056 %identity 




15 


%similar 




31 


%gap 




8 


slrl518%identity 






%similar 






%gap 







Amino acid sequence comparisons are performed using various Arabidopsis 
prenyltransferase sequences and the Synechocystis sequences. The comparisons are presented in 
Table 4 below. Provided are the percent identities, percent similarity, and the percent gap. The 
alignment of the sequences is provided in Figure 22. 



Table 4: 





ATPT2 sir 173 6 


ATPT3 


slr0926 


ATPT4 


sill 899 


ATPT12 


slr0056 


ATPT8 


slrl518 


ATPT2 


29 


9 


9 


8 


8 


12 


9 


7 


9 




46 


23 


21 


20 


20 


28 


23 


21 


20 




27 


13 


28 


23 


29 


11 


24 


25 


24 


slrl736 




9 


13 


8 


12 


13 


15 


8 


10 






19 


28 


19 


28 


26 


33 


21 


26 






34 


12 


34 


15 


26 


10 


12 


10 


ATPT3 






23 


11 


14 


13 


10 


5 


11 








36 


26 


26 


26 




14 


22 








29 


21 


31 


16 


30 


30 


30 










12 


20 


17 


20 


11 


14 


slr0926 








24 


37 


28 


33 


24 


29 










33 


12 


25 


10 


11 


9 
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18 


11 

X X 


8 


6 


7 




ATPT4 


33 


23 


18 


16 


19 

X V 






28 


19 


32 


32 


33 








13 


17 


10 


12 




sill 899 

Oil 1077 




94 




93 


96 








27 


13 


10 


ii 










52 


8 


ii 




ATPT1 






66 


19 


26 




2 




















18 


25 


23 
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13 
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slr0056 
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19 
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10 
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ATPT8 










23 
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■ . P*m**T 
> IF-F«- 


slrl518 













4C. Phytyl Preny transferase Enzyme Assays 

[ H] Homogentisic acid in 0.1% H3PO4 (specific radioactivity 40 Ci/mmol). Phytyl 
5 pyrophosphate was synthesized as described by Joo, et ah (1973) Can J, Biochem. 51 :1527. 2- 
methyl-6-phytylquinol and 2 ? 3-dimethyl-5-phytylquinol were synthesized as described by Soli, et 
al (1980) Phytochemistry 19:215. Homogentisic acid, a, (3, 8, and y-tocopherol, and tocol, were 

purchased commercially. 

The wild-type strain of Synechocystis sp. PCC 6803 was grown in BG1 1 medium with 

1 0 bubbling air at 30°C under 50 jLiE.m" .s" fluorescent light, and 70% relative humidity. The growth 

medium of slrl736 knock-out (potential PPT) strain of this organism was supplemented with 25 
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|Ltg mL" 1 kanamycin. Cells were collected from 0.25 to 1 liter culture by centrifugation at 5000 g 

for 10 min and stored at -80°C. 

Total membranes were isolated according to Zak's procedures with some modifications (Zak, 
etal (1999) Eur J. Biochem 261:311). Cells were broken on a French press. Before the French 
5 press treatment, the cells were incubated for 1 hour with lysozyme (0.5%, w/v) at 30 °C in a 
medium containing 7 mM EDTA, 5 mM NaCl and 10 mM Hepes-NaOH, pH 7.4. The 
spheroplasts were collected by centrifugation at 5000 g for 10 min and resuspended at 0.1 - 0.5 mg 
chlorophylFmL" 1 in 20 mM potassium phosphate buffer, pH 7.8. Proper amount of protease 
inhibitor cocktail and DNAase I from Boehringer Mannheim were added to the solution. French 
1 0 press treatments were performed two to three times at 1 00 MPa. After breakage, the cell 
O suspension was centrifuged for 1 0 min at 5000g to pellet unbroken cells, and this was followed by 
!;« centrifugation at 100 000 g for 1 hour to collect total membranes. The final pellet was resuspended 
j B in a buffer containing 50 mM Tris-HCL and 4 mM MgCl 2 . 

□ Chloroplast pellets were isolated from 250 g of spinach leaves obtained from local markets. 

% 5 Devined leaf sections were cut into grinding buffer (2 1 /250 g leaves) containing 2 mM EDTA, 1 

mM MgCl 2 , 1 mM MnCl 2 , 0.33 M sorbitol, 0.1% ascorbic acid, and 50 mM Hepes at pH 7.5. The 
Q leaves were homogenized for 3 sec three times in a 1-L blendor, and filtered through 4 layers of 
V Z mirocloth. The supernatant was then centrifuged at 5000g for 6 min. The chloroplast pellets were 
O resuspended in small amount of grinding buffer (Douce,^ al Methods in Chloroplast Molecular 
lo Biology, 239 (1982) 

Chloroplasts in pellets can be broken in three ways. Chloroplast pellets were first aliquoted 
in 1 mg of chlorophyll per tube, centrifuged at 6000 rpm for 2 min in microcentrifuge, and 
grinding buffer was removed. Two hundred microliters of Triton X-100 buffer (0.1% Triton X- 
100, 50 mM Tris-HCl pH 7.6 and 4 mM MgCl 2 ) or swelling buffer (10 mM Tris pH 7.6 and 4 

2 5 mM MgCl 2 ) was added to each tube and incubated for V 2 hour at 4°C. Then the broken 

chloroplast pellets were used for the assay immediately. In addition, broken chloroplasts can also 
be obtained by freezing in liquid nitrogen and stored at -80°C for 14 hour, then used for the assay. 

In some cases chloroplast pellets were further purified with 40%/ 80% percoll gradient to 
obtain intact chloroplasts. The intact chloroplasts were broken with swelling buffer, then either 

3 0 used for assay or further purified for envelope membranes with 20.5%/ 3 1.8% sucrose density 
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gradient (Sol, et al (1980) supra). The membrane fractions were centrifuged at 100 OOOg for 40 
min and resuspended in 50 mM Tris-HCl pH 7.6, 4 mM MgC^. 

Various amounts of [ H]HGA, 40 to 60 \iM unlabelled HGA with specific activity in the 
range of 0.16 to 4 Ci/mmole were mixed with a proper amount of 1M Tris-NaOH pH 10 to adjust 
5 pH to 7.6. HGA was reduced for 4 min with a trace amount of solid NaBH 4 . In addition to HGA, 
standard incubation mixture (final vol 1 mL) contained 50 mM Tris-HCl, pH 7.6, 3-5 mM MgC^, 
and 100 jiM phytyl pyrophosphate. The reaction was initiated by addition of Synechocystis total 
membranes, spinach chloroplast pellets, spinach broken chloroplasts, or spinach envelope 
membranes. The enzyme reaction was carried out for 2 hour at 23 °C or 30°C in the dark or light. 
1 0 The reaction is stopped by freezing with liquid nitrogen, and stored at -80°C or directly by 
extraction, 

u 

y A constant amount of tocol was added to each assay mixture and reaction products were 

y extracted with a 2 mL mixture of chlorofcrm/methanol (1:2, v/v) to give a monophasic solution. 
-I NaCl solution (2 mL; 0.9%) was added with vigorous shaking. This extraction procedure was 

■— ■« 

J 5 repeated three times. The organic layer containing the prenylquinones was filtered through a 20 

mp, filter, evaporated under N2, and then resuspended in 100 |uL of ethanol. 
-j The samples were mainly analyzed by Normal-Phase HPLC method (Isocratic 90% Hexane 

2 and 10% Methyl-t-butyl ether), and use a Zorbax silica column, 4.6 x 250 mm. The samples were 

Hi"" 

3 also analyzed by Reversed-Phase HPLC method (Isocratic 0.1 % H3PO4 in MeOH), and use a 
2o Vydac 201HS54 C18 column; 4.6 x 250 mm coupled with an All-tech C18 guard column. The 

amount of products were calculated based on the substrate specific radioactivity, and adjusted 
according to the % recovery based on the amount of internal standard. 

The amount of chlorophyll was determined as described in Arnon (1949) Plant Physiol 24:1. 
Amount of protein was determined by the Bradford method using gamma globulin as a standard 
2 5 (Bradford, (1976) Anal. Biochem. 72:248) 

Results of the assay demonstrate that 2-Methyl-6-Phytylplastoquinone is not produced in 
the Synechocystis slrl736 knockout preparations. The results of the phytyl prenyltransferase 
enzyme activity assay for the slrl736 knock out are presented in Figure 23. 



3 0 4D. Complementation of the slrl736 knockout with ATPT2 
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In order to determine whether ATPT2 could complement the knockout of sir 1736 in 
Synechocystis 6803, a plasmid was constructed to express the ATPT2 sequence from the TAC 
promoter. A vector, plasmid psl 1 2 1 1 ? was obtained from the lab of Dr. Himadri Pakrasi of 
Washington University, and is based on the plasmid RSF1010 which is a broad host range 
plasmid (Ng W.-O., Zentella R., Wang, Y., Taylor J-S. A., Pakrasi, H.B. 2000. phrA, the major 
photoreactivating factor in the cyanobacterium Synechocystis sp. strain PCC 6803 codes for a 
cyclobutane pyrimidine dimer specific DNA photolyase. Arch. Microbiol, (in press)). The 
ATPT2 gene was isolated from the vector pCGN10817 by PCR using the following primers. 
ATPT2nco.pr S'-CCATGGATTCGAGTAAAGTTGTCGC (SEQ ID NO:89); ATPT2ri.pr- 5'- 
GAATTCACTTCAAAAAAGGTAACAG (SEQ ID NO:90). These primers will remove 
approximately 1 12 BP from the 5' end of the ATPT2 sequence, which is thought to be the 
chloroplast transit peptide. These primers will also add an Ncol site at the 5' end and an EcoRI 
site at the 3 5 end which can be used for sub-cloning into subsequent vectors. The PCR product 
from using these primers and pCGN10817 was ligated into pGEM T easy and the resulting 
vector pMON21689 was confirmed by sequencing using the ml3forward and ml3reverse 
primers. The NcoI/EcoRI fragment from pMON21689 was then ligated with the Eagl/EcoRI 
and Eagl/Ncol fragments from psll21 1 resulting in pMON21690. The plasmid pMON21690 
was introduced into the sir 173 6 Synechocystis 6803 KO strain via conjugation. Cells of sl906 (a 
helper strain) and DH10B cells containing pMON21690 were grown to log phase (O.D. 600= 
0.4) and 1 ml was harvested by centrifugation. The cell pellets were washed twice with a sterile 
BG-1 1 solution and resuspended in 200 ul of BG-1 1 . The following was mixed in a sterile 
eppendorf tube: 50 ul SL906, 50 ul DH10B cells containing pMON21690, and 100 ul of a fresh 
culture of the slrl736 Synechocystis 6803 KO strain (O.D. 730 = 0.2-0.4). The cell mixture was 
immediately transferred to a nitrocellulose filter resting on BG-1 1 and incubated for 24 hours at 
30C and 2500 LUX(50 ue) of light. The filter was then transferred to BG-1 1 supplemented with 
lOug/ml Gentamycin and incubated as above for -5 days. When colonies appeared, they were 
picked and grown up in liquid BG-1 1 + Gentamycin 10 ug/ml. (Elhai, J. and Wolk, P. 1988. 
Conjugal transfer of DNA to Cyanobacteria. Methods in Enzymology 167, 747-54) The liquid 
cultures were then assayed for tocopherols by harvesting 1ml of culture by centrifugation, 
extracting with ethanol/pyrogallol, and HPLC separation. The sir 1736 Synechocystis 6803 KO 
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strain, did not contain any detectable tocopherols, while the slrl736 Synechocystis 6803 KO 
strain transformed with pmon21690 contained detectable alpha tocopherol. A Synechocystis 
6803 strain transformed with psll21 1 (vector control) produced alpha tocopherol as well. 

4E: Additional Evidence of Prenyltransferase Activity 

To test the hypothesis that sir 1736 or ATPT2 are sufficient as single genes to obtain 
phytyl prenyltransferase activity, both genes were expressed in SF9 cells and in yeast. When 
either slrl736 or ATPT2 were expressed in insect cells (Table 5) or in yeast, phytyl 
prenyltransferase activity was detectable in membrane preparations, whereas membrane 
preparations of the yeast vector control, or membrane preparations of insect cells did not exhibit 
phytyl prenyltransferase activity. 

Table 5: Phytyl prenyltransferase activity 



Enzyme source 


Enzyme activity 




f pmol/mg x h] 


slrl736 expressed in SF9 cells 


20 


ATPT2 expressed in SF9 cells 


6 


SF9 cell control 


<0.05 


Synechocystis 6803 


0.25 


Spinach chloroplasts 


0.20 



Example 5: Transgenic Plant Analysis 
5A. Arabidopsis 

Arabidopsis plants transformed with constructs for the sense or antisense expression of 
the ATPT proteins were analyzed by High Pressure Liquid Chromatography (HPLC) for altered 
levels of total tocopherols, as well as altered levels of specific tocopherols (alpha, beta, gamma, 
and delta tocopherol). 

i 

Extracts of leaves and seeds were prepared for HPLC as follows. For seed extracts, 10 
mg of seed was added to 1 g of microbeads (Biospec) in a sterile microfuge tube to which 500 ul 
1% pyrogallol (Sigma Chem)/ethanol was added. The mixture was shaken for 3 minutes in a 
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mini Beadbeater (Biospec) on "fast" speed. The extract was filtered through a 0.2 um filter into 
an autosampler tube. The filtered extracts were then used in HPLC analysis described below. 

Leaf extracts were prepared by mixing 30-50 mg of leaf tissue with 1 g microbeads and 
freezing in liquid nitrogen until extraction. For extraction, 500 ul 1% pyrogallol in ethanol was 
added to the leaf/bead mixture and shaken for 1 minute on a Beadbeater (Biospec) on "fast" 
speed. The resulting mixture was centrifuged for 4 minutes at 14,000 rpm and filtered as 
described above prior to HPLC analysis. 

HPLC was performed on a Zorbax silica HPLC column (4.6 mm X 250 mm) with a 
fluorescent detection, an excitation at 290 nm, an emission at 336 nm, and bandpass and slits. 
Solvent A was hexane and solvent B was methyl-t-butyl ether. The injection volume was 20 ul, 
the flow rate was 1 .5 ml/min, the run time was 12 min (40°C) using the gradient (Table 6): 



Table 6: 

Time Solvent A Solvent B 

Omin. 90% 10% 

10 min. 90% 10% 

11 min. 25% 75% 

12 min. 90% 10% 



Tocopherol standards in 1 % pyrogallol/ ethanol were also run for comparison (alpha 
tocopherol, gamma tocopherol, beta tocopherol, delta tocopherol, and tocopherol (tocol) (all from 
Matreya). 

Standard curves for alpha, beta, delta, and gamma tocopherol were calculated using 
Chemstation software. The absolute amount of component x is: Absolute amount of x= 
Response x x RF X x dilution factor where Response x is the area of peak x, RF X is the response 
factor for component x (Amount x /Response x ) and the dilution factor is 500 ul. The ng/mg tissue 
is found by: total ng component/mg plant tissue. 

Results of the HPLC analysis of seed extracts of transgenic Arabidopsis lines containing 
pMON 10822 for the expression of ATPT2 from the napin promoter are provided in Figure 24. 
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HPLC analysis results of segregating T2 Arabidopsis seed tissue expressing the ATPT2 
sequence from the napin promoter (pCGN10822) demonstrates an increased level of tocopherols 
in the seed. Total tocopherol levels are increased as much as 50% over the total tocopherol 
levels of non-transformed (wild-type) Arabidopsis plants (Figure 25). Homozygous progeny 
from the top 3 lines (T3 seed) have up to a two-fold (100%) increase in total tocopherol levels 
over control Arabidopsis seed ( Figure 26.) 

Furthermore, increases of particular tocopherols are also increased in transgenic 
Arabidopsis plants expressing the ATPT2 nucleic acid sequence from the napin promoter. 
Levels of delta tocopherol in these lines are increased greater than 3 fold over the delta 
tocopherol levels obtained from the seeds of wild type Arabidopsis lines. Levels of gamma 
tocopherol in transgenic Arabidopsis lines expressing the ATPT2 nucleic acid sequence are 
increased as much as about 60% over the levels obtained in the seeds of non-transgenic control 
lines. Furthermore, levels of alpha tocopherol are increased as much as 3 fold over those 
obtained from non-transgenic control lines. 

Results of the HPLC analysis of seed extracts of transgenic Arabidopsis lines containing 
pCGN 10803 for the expression of ATPT2 from the enhanced 35S promoter (antisense 
orientation ) are provided in Figure 25. Two lines were identified that have reduced total 
tocopherols, up to a ten-fold decrease observed in T3 seed compared to control Arabidopsis 
(Figure 27.) 

5B. Canola 

Brassica napus, variety SP30021, was transformed with pCGN 10822 (napin- ATPT2- 
napin 3\ sense orientation) using Agrobacterium tumefaciens-mediated transformation. Flowers 
of the R0 plants were tagged upon pollination and developing seed was collected at 35 and 45 
days after pollination (DAP). 

Developing seed was assayed for tocopherol levels, as described above for Arabidopsis. 
Line 10822-1 shows a 20% increase of total tocopherols, compared to the wild-type control, at 45 
DAP. Figure 28 shows total tocopherol levels measured in developing canola seed. 

Example 6: Sequences to Tocopherol Cyclase 



44 



Attorney Docket No: 
52015/01/US 

6A. Preparation of the slrl737 Knockout 

The Synechocystis sp. 6803 slrl737 knockout was constructed by the following method. 
The GPS™-1 Genome Priming System (New England Biolabs) was used to insert, by a Tn7 
Transposase system, a Kanamycin resistance cassette into slrl 737. A plasmid from a 
5 Synechocystis genomic library clone containing 652 base pairs of the targeted orf (Synechcocystis 
genome base pairs 1324051 - 1324703; the predicted orf base pairs 1323672 - 1324763, as 
annotated by Cyanobase) was used as target DNA. The reaction was performed according to the 
manufacturers protocol. The reaction mixture was then transformed into E. coli DH10B 
electrocompetant cells and plated. Colonies from this transformation were then screened for 
1 0 transposon insertions into the target sequence by amplifying with Ml 3 Forward and Reverse 
^ Universal primers, yielding a product of 652 base pairs plus -1700 base pairs, the size of the 
y transposon kanamycin cassette, for a total fragment size of -2300 base pairs. After this 
0 determination, it was then necessary to determine the approximate location of the insertion 

-* H 

^ within the targeted orf, as 100 base pairs of orf sequence was estimated as necessary for efficient 
9. 5 homologous recombination in Synechocystis. This was accomplished through amplification 
reactions using either of the primers to the ends of the transposon, Primer S (5' end) or N (3' 
-I end), in combination with either a Ml 3 Forward or Reverse primer. That is, four different primer 

f 

2 combinations were used to map each potential knockout construct: Primer S - Ml 3 Forward, 

3 Primer S - Ml 3 Reverse, Primer N - Ml 3 Forward, Primer N - Ml 3 Reverse. The construct 
5 0 used to transform Synechocystis and knockout slrl 737 was determined to consist of a 

approximately 150 base pairs of slrl 737 sequence on the 5 ? side of the transposon insertion and 
approximately 500 base pairs on the 3' side, with the transcription of the orf and kanamycin 
cassette in the same direction. The nucleic acid sequence of sir 173 7 is provided in SEQ ID 
NO:38 the deduced amino acid sequence is provided in SEQ ID NO:39. 

2 5 Cells of Synechocystis 6803 were grown to a density of - 2xl0 8 cells per ml and 

harvested by centrifugation. The cell pellet was re-suspended in fresh BG-1 1 medium at a 
density of 1 xl 0 9 cells per ml and used immediately for transformation. 1 00 ul of these cells were 
mixed with 5 ul of mini prep DNA and incubated with light at 30C for 4 hours. This mixture 
was then plated onto nylon filters resting on BG-1 1 agar supplemented with TES ph8 and 

3 0 allowed to grow for 12-18 hours. The filters were then transferred to BG-1 1 agar + TES + 
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5ug/ml kanamycin and allowed to grow until colonies appeared within 7-10 days (Packer and 
Glazer, 1988). Colonies were then picked into BG-1 1 liquid media containing 5 ug/ml 
kanamycin and allowed to grow for 5 days. These cells were then transferred to Bg-1 1 media 
containing lOug/ml kanamycin and allowed to grow for 5 days and then transferred to Bg-1 1 + 
5 kanamycin at 25ug/ml and allowed to grow for 5 days. Cells were then harvested for PCR 

analysis to determine the presence of a disrupted ORF and also for HPLC analysis to determine if 
the disruption had any effect on tocopherol levels. 

PCR analysis of the Synechocystis isolates, using primers to the ends of the slrl 737 orf , 
showed complete segregation of the mutant genome, meaning no copies of the wild type genome 
1 0 could be detected in these strains. This suggests that function of the native gene is not essential 

for cell function. HPLC analysis of the strain carrying the knockout for slrl 73 7 produced no 
:y detectable levels of tocopherol. 

. >■ *■ 

* n " 

* J * 

. ' "F«*V> 
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6B. The relation of slrl 737 and slrl 736 

■ * *"*^* 

; 5 The slrl 737 gene occurs in Synechocystis downstream and in the same orientation as 

slrl 736, the phytyl prenyltransf erase. In bacteria this proximity often indicates an operon 

• f M. «ll 

H structure and therefore an expression pattern that is linked in all genes belonging to this operon. 
V Z Occasionally such operons contain several genes that are required to constitute one enzyme. To 

r 

l!3 confirm that slrl737 is not required for phytyl prenyltransferase activity, phytyl prenyltransferase 

■ ipni 

» * 

■ ** 

2 0 was measured in extracts from the Synechocystis sir 173 7 knockout mutant. Figure 29 shows that 
extracts from the Synechocystis sir 173 7 knockout mutant still contain phytyl prenyltransferase 
activity. The molecular organization of genes in Synechocystis 6803 is shown in A. Figures B 
and C show HPLC traces (normal phase HPLC) of reaction products obtained with membrane 
preparations from Synechocystis wild type and slrl 737" membrane preparations, respectively. 

2 5 The fact that slrl 737 is not required for the PPT activity provides additional data that 

ATPT2 and slrl 736 encode phytyl prenyltransferases. 

6C Synechocystis Knockouts 

Synechocystis 6803 wild type and Synechocystis slrl 737 knockout mutant were grown 
30 photoautotrophically. Cells from a 20 ml culture of the late logarithmic growth phase were 
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harvested and extracted with ethanoL Extracts were separated by isocratic normal-phase HPLC 
using a Hexane/Methyl-t-butyl ether (95/5) and a Zorbax silica column, 4.6 x 250 mm. 
Tocopherols and tocopherol intermediates were detected by fluorescence (excitement 290 nm, 
emission 336 nm) (Figure 30). 

Extracts of Synechocystis 6803 contained a clear signal of alpha-tocopherol. 2,3- 
Dimethyl-5-phytylplastoquinol was below the limit of detection in extracts from the 
Synechocystis wild type (C). In contrast, extracts from the Synechocystis slrl737 knockout 
mutant did not contain alpha-tocopherol, but contained 2,3-dimethyl-5-phytylplastoquinol (D), 
indicating that the interruption of slrl737 has resulted in a block of the 2,3-dimethyl-5- 
phytylplastoquinol cyclase reaction. 

Chromatograms of standard compounds alpha, beta, gamma, delta-tocopherol and 2,3- 
dimethyl-5-phytylplastoquinol are shown in A and B. Chromatograms of extracts form 
Synechocystis wild type and the Synechocystis slrl737 knockout mutant are shown in C and D, 
respectively. Abbreviations: 2,3-DMPQ, 2,3-dimethyl-5-phytylplastoquinol. 

6D. Incubation with Lysozyme treated Synechocystis 

Synechocystis 6803 wild type and slrl737 knockout mutant cells from the late logarithmic 
growth phase (approximately 1 g wet cells per experiment in a total volume of 3 ml) were treated 
with Lysozyme and subsequently incubated with S-adenosylmethionine, and 
phytylpyrophosphate, plus radiolabeled homogentisic acid. After 17h incubation in the dark at 
room temperature the samples were extracted with 6 ml chloroform / methanol (1/2 v/v). Phase 
separation was obtained by the addition of 6 ml 0.9% NaCl solution. This procedure was 
repeated three times. Under these conditions 2,3-dimethyl-5-phytylplastoquinol is oxidized to 
form 2 5 3-dimethyl-5-phytylplastoquinone. 

The extracts were analyzed by normal phase and reverse phase HPLC. Using extracts 
from wild type Synechocystis cells radiolabeled gamma-tocopherol and traces of radiolabeled 
2,3-dimethyl-5-phytylplastoquinone were detected. When extracts from the slrl737 knockout 
mutant were analyzed, only radiolabeled 2,3-dimethyl-5-phytylplastoquinone was detectable. 
The amount of 2,3-dimethyl-5-phytylplastoquinone was significantly increased compared to wild 
type extracts. Heat treated samples of the wild type and the slrl737 knockout mutant did not 
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produce radiolabeled 2,3-dimethyl-5-phytylplastoquinone, nor radiolabelled tocopherols. These 
results further support the role of the slrl737 expression product in the cyclization of 2,3- 
dimethyl-5-phylylplastoquinoL 



5 6E. Arabidopsis Homologue to slrl737 

An Arabidopsis homologue to slrl737 was identified from a BLASTALL search using 
Synechocystis sp 6803 gene slrl737 as the query, in both public and proprietary databases. SEQ 
ID NO: 109 and SEQ ID NO:l 10 are the DNA and translated amino acid sequences, respectively, 
of the Arabidopsis homologue to sir 1737. The start if found at the ATG at base 56 in SEQ ID 
10 NO:109. 

n The sequences obtained for the homologue from the proprietary database differs from the 

< -ri-ii 

y public database (F4D1 1.30, BAC AL022537), in having a start site 471 base pairs upstream of 
0 the start identified in the public sequence. A comparison of the public and proprietary sequences 

I ^ IIHI 
** V 

% is provided in Figure 3 1 . The correct start correlates within the public database sequence is at 

J 5 12080, while the public sequence start is given as being at 1 1609. 

r m 

Attempts to amplify a slrl737 homologue were unsuccessful using primers designed from 

b*«VU 

■ m i <i i 

*! the public database, while amplification of the gene was accomplished with primers obtained 
" from SEQ ID NO: 109. 

1 

3 Analysis of the protein sequence to identify transit peptide sequence predicted two 

7 0 potential cleavage sites, one between amino acids 48 and 49, and the other between amino acids 
98 and 99. 



6F . sir 1 73 7 Protein Information 

The slrl737 orf comprises 363 amino acid residues and has a predicted MW of 41kDa 

2 5 (SEQ ID NO: 39). Hydropathic analysis indicates the protein is hydrophillic (Figure 32). 

The Arabidopsis homologue to sir 1737 (SEQ ID xx) comprises 488 amino acid residues, 
has a predicted MW of 55kDa, and a has a putative transit peptide sequence comprising the first 
98 amino acids. The predicted MW of the mature form of the Arabidopsis homologue is 44kDa. 
The hydropathic plot for the Arabidopsis homologue also reveals that it is hydrophillic (Figure 

3 0 33). Further blast analysis of the Arabidopsis homologue reveals limited sequence identity (25 % 
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sequence identity) with the beta-subunit of respiratory nitrate reductase. Based on the sequence 
identity to nitrate reductase, it suggests the sir 173 7 orf is an enzyme that likely involves general 
acid catalysis mechanism. 

Investigation of known enzymes involved in tocopherol metabolism indicated that the 
5 best candidate corresponding to the general acid mechanism is the tocopherol cyclase. There are 
many known examples of cyclases including, tocopherol cyclase, chalcone isomerase, lycopene 
cyclase, and aristolochene synthase. By further examination of the microscopic catalytic 
mechanism of phytoplastoquinol cyclization, as an example, chalcone isomerase has a catalytic 
mechanism most similar to tocopherol cyclase. (Figure 34). 
1 0 Multiple sequence alignment was performed between sir 1 737, slrl 737 Arabidopsis 

;s% homologue and the Arabidopsis chalcone isomerase (Genbank:P41088) (Figure 35). 65% of the 
y conserved residues among the three enzymes are strictly conserved within the known chalcone 

* -\ r 

;n isomerases. The crystal structure of alfalfa chalcone isomerase has been solved (Jez, Joseph M., 

— m 

S Bowman, Marianne E. ? Dixon, Richard A., and Noel, Joseph P. (2000) "Structure and 

ft 5 mechanism of the evolutionarily unique plant enzyme chalcone isomerase". Nature Structural 

t **** 

Biology 7: 786-791 .) It has been demonstrated tyrosine (Y) 106 of the alfalfa chalcone 

1 isomerase serves as the general acid during cyclization reaction (Genbank: P28012). The 

: j; equivalent residue in slrl 737 and the slrl 737 Arabidopsis homolog is lysine (K), which is an 
;3 excellent catalytic residue as general acid. 

"2 0 The information available from partial purification of tocopherol cyclase from Chlorella 

protothecoides (U.S. Patent No. 5,432,069), i.e., described as being glycine rich, water soluble 
and with a predicted MW of 48-5 OkDa, is consistent with the protein informatics information 
obtained for the slrl 737 and the Arabidopsis slrl 737 homologue. 

2 5 All publications and patent applications mentioned in this specification are indicative of 

the level of skill of those skilled in the art to which this invention pertains. All publications and 
patent applications are herein incorporated by reference to the same extent as if each individual 
publication or patent application was specifically and individually indicated to be incorporated by 
reference. 
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Although the foregoing invention has been described in some detail by way of illustration 
and example for purposes of clarity of understanding, it will be obvious that certain changes and 
modifications may be practiced within the scope of the appended claim. 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid sequence encoding a tocopherol cyclase. 

2. An isolated nucleic acid sequence according to Claim 1, wherein said tocopherol 
cyclase is active in the cyclization of 2,3-dimethyl-5-phytylplastoquinol to tocopherol. 

3. An isolated nucleic acid sequence according to Claim 1, wherein said tocopherol 
cyclase is active in the cyclization of 2,3-dimethyl-5-geranylgeranylplastoquinol to tocotrienol 

4. An isolated DNA sequence according to Claim 1, wherein said nucleic acid sequence is 
isolated from a eukaryotic cell source. 

5. An isolated DNA sequence according to Claim 4, wherein said eukaryotic cell source is 
selected from the group consisting of mammalian, nematode, fungal, and plant cells. 

6. The DNA encoding sequence of Claim 5 wherein said tocopherol cyclase protein is from 
Arabidopsis. 

7. The DNA encoding sequence of Claim 6 wherein said tocopherol cyclase protein is encoded 
by a sequence of SEQ ID NO: 109. 

8. The DNA encoding sequence of Claim 7 wherein said tocopherol cyclase protein has an 
amino acid sequence of SEQ ID NO:l 10. 

9. The DNA encoding sequence of Claim 4 wherein said tocopherol cyclase protein is from a 
source selected from the group consisting of Arabidopsis, soybean, corn, rice, wheat, leek canola, , leek, 
cotton, and tomato. 

10. An isolated DNA sequence according to Claim 4, wherein said prokaryotic source is a 
Synechocystis sp. 

1 1 . The DNA encoding sequence of Claim 1 0 wherein said tocopherol cyclase protein is 
encoded by a sequence of SEQ ID NO:38. 

12. The DNA encoding sequence of Claim 10 wherein said tocopherol cyclase protein has an 
amino acid sequence of SEQ ID NO:39. 

13. A nucleic acid construct comprising as operably linked components, a transcriptional 
initiation region functional in a host cell, a nucleic acid sequence encoding a tocopherol cyclase, and a 
transcriptional termination region. 
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14. A nucleic acid construct according to Claim 13, wherein said nucleic acid sequence encoding 
tocopherol cyclase is obtained from an organism selected from the group consisting of a eukaryotic 
organism and a prokaryotic organism. 

15. A nucleic acid construct according to Claim 14, wherein said nucleic acid sequence encoding 
tocopherol cyclase is obtained from a plant source. 

16. A nucleic acid construct according to Claim 15, wherein said nucleic acid sequence encoding 
tocopherol cyclase is obtained from a source selected from the group consisting of Arabidopsis, soybean, 
corn, rice, wheat, leek canola, , leek, cotton, and tomato. 

17. A nucleic acid construct according to Claim 13, wherein said nucleic acid sequence encoding 
tocopherol cyclase is obtained from a Synechocystis sp. 

18. A plant cell comprising the construct of 13. 

19. A plant comprising a cell of Claim 18. 

20 A feed composition produced from a plant according to Claim 19, 
21 . A seed comprising a cell of Claim 1 8. 
22 Oil obtained from a seed of Claim 2 1 . 

23. A natural tocopherol rich refined and deodorised oil which has been produced by a 
method of treating an oil according to Claim 22 by distilling under low pressure and high 
temperature, wherein said refined oil has reduced free fatty acids and a substantial percentage of 
tocopherol present in the pretreated oil. 

24. A refined oil according to claim 23, wherein the pretreated oil is crude or pre-treated 
soybean oil. 

25. A refined oil according to claim 23, wherein the refined oil is degummed and 
bleached. 

26. A method for the alteration of the isoprenoid content in a host cell, said method comprising; 
transforming said host cell with a construct comprising as operably linked components, a transcriptional 
initiation region functional in a host cell, a nucleic acid sequence encoding tocopherol cyclase, and a 
transcriptional termination region, 

wherein said isoprenoid compound selected from the group of tocopherols and tocotrienols . 

27. The method according to Claim 26, wherein said host cell is selected from the group 
consisting of a prokaryotic cell and a eukaryotic cell. 
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28. The method according to Claim 27, wherein said prokaryotic cell is a Synechocystis sp. 

29. The method according to Claim 27, wherein said eukaryotic cell is a plant cell 

30. The method according to Claim 29, wherein said plant cell is obtained from a plant selected 
from the group consisting of Arabidopsis, soybean, corn, rice, wheat, leek canola, , leek, cotton, and 

5 tomato. 

3 1 . A method for producing an isoprenoid compound of interest in a host cell, said method 
comprising obtaining a transformed host cell, said host cell having and expressing in its genome: 

a construct having a DNA sequence encoding a tocopherol cyclase operably linked to a 
transcriptional initiation region functional in a host cell, 
1 0 wherein said isoprenoid compound selected from the group of tocopherols and tocotrienols. 

; «i 32. The method according to Claim 3 1 , wherein said host cell is selected from the group 

: y consisting of a prokaryotic cell and a eukaryotic cell. 

r -r 

T T>* ^ 

i;0 33. The method according to Claim 32, wherein said prokaryotic cell is a Synechocystis sp. 

:4 34. The method according to Claim 32, wherein said eukaryotic cell is a plant cell. 

% 5 35. The method according to Claim 34, wherein said plant cell is obtained from a plant selected 

from the group consisting wherein said compound selected from the group of Arabidopsis, soybean, 
j<« 5 corn, rice, wheat, leek canola, , leek, cotton, and tomato. 

\: 36. A method for increasing the biosynthetic flux in a host cell toward production of an 

3 isoprenoid compound, said method comprising; 

"2 0 transforming said host cell with a construct comprising as operably linked components, a 

transcriptional initiation region functional in a host cell, a DNA encoding a tocopherol cyclase, 
and a transcriptional termination region, 

wherein said isoprenoid compound selected from the group of tocopherols and 
tocotrienols,. 

2 5 37. The method according to Claim 36, wherein said host cell is selected from the group 

consisting of a prokaryotic cell and a eukaryotic cell. 

38. The method according to Claim 37, wherein said prokaryotic cell is a Synechocystis sp. 

39. The method according to Claim 37, wherein said eukaryotic cell is a plant cell. 
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40. The method according to Claim 39, wherein said plant cell is obtained from a plant selected 
from the group consisting Arabidopsis, soybean, corn, rice, wheat, leek canola, , leek, cotton, and 
tomato. 

41 . The method according to Claim 39, wherein said transcriptional initiation region is a seed- 
5 specific promoter. 
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NUCLEIC ACID SEQUENCES TO PROTEINS INVOLVED IN TOCOPHEROL 

SYNTHESIS 

ABSTRACT 

Nucleic acid sequences and methods are provided for producing plants and seeds having 
altered tocopherol content and compositions. The methods find particular use in increasing the 
tocopherol levels in plants, and in providing desirable tocopherol compositions in a host plant 
cell. 
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9428 



Figure 8 



Pvuil 121 
EcoRI 296 
Sad 306 
Saci 645 

EcoRV 1549 

Xbai 1578 



EcoRV 17530 
Bglli 



17334 



EcoRI 1773 
Sail 2075 
Pstl 2085 
Psti 2168 
Pvull 2191 
Bglll 2257 
Hindlll 2322 
Pstl 2487 
Psti 2659 
Dral 2825 
Hindiil 2922 
Hindlil 2972 
Not! 3316 




Sac! 15781 
EcoRV 15683 

Ciai 15669 
Pvul 15669 

Pvui 15097 

Xhol 14454 

Pvui 14023 
Dral 13868 
Sac! 13749 
Pvull 13739 
Bglll 13467 
Xhol 12856 
Pvul 12348 
Pvull 12213 
Saci 12101 
Bglll 12078 
Smal 12063 
EcoRI 11733 
EcoRI 11585 
Pvul 11445 
Pvul! 11196 f 
Ncol 11102 
EcoRV 11046 

Sail 10409 
Pvul 10374 
Xhol 10344 
Bglll 10288 
EcoRV 10157 
Pvul 9958 



BamHl 3323 
Xhol 3328 
BamHl 3645 
EcoRV 4489 
Pvuil 4688 
Pvul 4719 
Xhol 4768 
Pvull 5004 
EcoRV 5299 
Sail 5509 
EcoRi 5521 
Pvuil 5776 
Ncol 6102 
EcoRi 6511 
Smai 6518 



Xhol 7660 
Pvul 7670 
Dral 8122 
Pvui 8206 
Xhol 8651 
Smal 8815 
EcoRV 8846 
EcoRV 9018 
EcoRV 9308 



Figure 9 



Pvuli 121 
EcoRi 296 
Saci 306 
Sad 645 

EcoRV 



1549 

Xbal 1578 
EcoRi 1773 
BamHi 2080 
20B7 
Hindlll 2431 
Hindlll 2481 
Dral 2582 
2752 
2924 



EcoRV 17530 v 
Bgli! 17334-x v 



Saci 15781 
EcoRV 15683 

Cla! 15689 
Pvul 15669 



Pvul 15097 — E 




Xhol 14454 

Pvul 14023 
Dral 13868 
Saci 13749 j 
Pvul I 13739 ' 
Bglli 13467 
Xhol 12856 
Pvul 12348 
Pvuli 12213 
Saci 12101 
Bglft 12078 
Smal 12063 
EcoRi 11733 
EcoRi 11585 
Pvul 11445 
Pvuli 11196 
Ncol 11102 
EcoRV 11046 

Sail 10409 
Pvul 10374 
Xhol 10344 
Bglll 10288 
EcoRV 101.57 
Pvul 9958 



Hindlll 3081 
Bglll 3146 
>vull 3216 
=>stl 3243 
Pstl 3326 

BamHI 3645 
EcoRV 4489 
h Pvuli 4688 
Hi Pvul 4719 
Xhol 4768 
-Pvuli 5004 
EcoRV 5299 
Sail 5509 
EcoRi 5521 
Pvuli 5776 
Ncol 6102 
EcoRi 6511 
Smai 6518 

7660 
7670 



EcoRV 
EcoRV 9018 
EcoRV 9308 



Figure 10 



Pvull 121 
EcoRi 296 
Saci 306 
Ncoi 346 
Still 486 
Ncoi 736 

EcoRV 1444 



Sacl 19291 - 

up 



EcoRV 17014 v 
Bgill 16818 -\ x 



Sacl 15265 
EcoRV 15167 

. Cia! 15153 



Pvul 15153 - 



Pvul 14581 



Xhol 13938 



aadA 

aacCi 




= Spc/Str 




RB 
polylinker 

d35S 

urbo polylinker 

ATPT3 
Turbo polylinker 



Pstl 1670 
Pstl 1753 
Pvull 1776 
Bglil 1842 
Hindlll 1907 
Pstl 2072 
Pstl 2244 
Dral 2410 
-lindlll 2507 
Hindlll 2557 
foil 2901 
^amHI 2908 
Bad 2925 



pCGN10812 
19314 bp 



35S promoter ^ 

XT \jtr f**/~S fx 




Pvul 13507 
Dral 13352 
Sacl 13233 
Pvull 13223 
Bgill 12951 
Xhol 12340 
Pvul 11832 
Pvull 11697 
Sacl 11585 
Bgill 11562 
Smal 11547 
EcoRI 11217 
EcoRI 11069 
Pvul 10929 
Pvull 10680 
Ncoi 10586 
EcoRV 10530 

Sail 9893 
Pvul 9858 
Xhol 9828 
Bgill 9772 



Pvull 4172 
Pvul 4203 
Xhol 4252 
Pvull 4488 

EcoRV 4783 
TSall 4993 
EcoRI 5005 
Pvull 5260 
Ncoi 5586 
EcoRI 5995 
Smal 6002 



Xhol 
Pvul 
Dral 7606 
Pvul 7690 
Xhol 8135 
Smal 8299 
EcoRV 8330 
EcoRV 8502 
EcoRV 8792 

9641 



7144 
7154 



Figure 1 1 



Pvull 121 
EcoRi 296 
Saci 306 
Ncol 346 
Stui 488 
Ncoi 736 

EcoRV 1444 



EcoRV 17072 
Bglli 16876- 

Saci 15323 
EcoRV 15225 

Cla! 1521 
Pvul 15211 

Pvul 14639 

Xhoi 13996 

Pvul 13565 
Oral 13410 
Saci 13291 
Pvull 13281 
Bglli 13009 
Xhol 12398 
Pvul 11890 
Pvull 11755 
Saci 11643 
Bglli 11620 
Smal 11605 
EcoRi 11275 
EcoRi 11127 
Pvul 10987 
Pvull 10738 
Ncol 10644 
EcoRV 10588 

Sail 9951 
Pvul 9916 
Xhol 9886 
Bglli 9830 
EcoRV 9699 



BamHI 1859 
Not! 1866 
Bglli 1685 
Bglli 1844 
Xbal 191 
Smal 2064 
Pvul! 2450 
Pvull 2519 
Pst! 2977 
Saci 2983 




Pvull 4230 
Pvul 4261 
Xhol 4310 
Pvull 4546 
EcoRV 4841 
Sail 5051 
EcoRi 5063 
Pvull 5318 
Ncoi 5644 
EcoRi 6053 
Smal 6060 



Xhol 7202 
Pvul 7212 
Dral 7664 
Pvul 7748 
Xhol 8193 
Smal 8357 
EcoRV 8388 
EcoRV 8560 
EcoRV 8850 
9500 



Figure 12 



Pvuil 121 
EcoRI 296 
Sac! 306 
Sacl 645 

EcoRV 1549 



EcoRV 17602 
Bglll 17406 

Sacl 15853 
EcoRV 15755 

Cla! 15741 
Pvul 15741 

Pvul 15169 

Xhol 14526 

Pvul 14095 
Dral 13940 
Sacl 13821 
Pvuil 13811 
Bglll 13539 
Xhol 12928 
Pvul 12420 
Pvul 
Saci 12' 
Bglll 12 
Smal 12 
EcoRI 
EcoRi 




Xbal 1578 
EcoRI 1773 
Sail 2075 
PstI 2085 
Pvuli 2539 
Pvuil 2608 
Smal 2994 
Xbal 3135 



Spc/Str 



pCGN10814 
1 9902 bp 




35S promoter 




Bglll 3210 
Bglll 3369 
Not! 3388 
BamHI 3395 
Xhol 3400 
BamHI 3717 
EcoRV 4561 
h Pvul I 4760 
i— 7/ Pvul 4791 
Xhol 4840 
Pvuil 5076 
EcoRV 5371 
Sail 5581 
EcoRI 5593 
Pvuil 5848 
Ncol 6174 
EcoRI 6583 
Smal 6590 



7732 
7742 



EcoRV 8918 
EcoRV 9090 
EcoRV 9380 



EcoRV 



Xhol 10416 
Bglll 10360 
EcoRV 10229 
Pvul 10030 



Figure 13 



Pvuil 121 
EcoRl 296 
Saci 306 
Sad 645 

EcoRV 1549 



Saci 15853 
EcoRV 15755 

Clal 15741 
Pvul 15741 

Pvui 15169 
Xhol 14526 




EcoRV 17602 
Bglll 17406 -\ 



Xbai 1578 
EcoRl 1773 
BamHI 2080 
Not! 2087 
Bglll 2106 
Bglll 2265 
Xbal 2340 
Smal 2485 
Pvul I 2871 
Pvul I 2940 



Spc/Sf 




pCGN10815 
19902 bp 



polylinker 
35S promote 



f^Xhol 



Pvul 14095 
Oral 13940 
Saci 13821 ' 
Pvuil 13811 1 
Bglll 13539 
Xhol 12928 
Pvul 12420 
Pvuil 12285 1 
Sad 12173 
Bglll 12150 
Smal 12135 
EcoRl 11805 
EcoRl 11657 
Pvul 11517 
Pvuil 11268 
Ncol 11174 
EcoRV 11118 

Sail 10481 
Pvul 10446 
Xhol 10416 
Bglll 10360 
EcoRV 10229 
Pvul 10030 




Psti 3398 
BamHI 3717 
EcoRV 4561 
u Pvuil 4760 
HJPvul 4791 
4840 
Pvuil 5076 
EcoRV 5371 
Sail 5581 
EcoRl 5593 
Pvuil 5848 
Ncol 6174 
EcoRl 6583 
Smal 6590 



EcoRV 
EcoRV 9380 



Figure 14 



Pvull 121 
EcoRl 296 
Sac! 306 
Ncoi 346 
Stui 488 
Ncol 736 

EcoRV 1444 



Sad 19363 



EcoRV 17086 
Bgill 16890 



Saci 15337 
EcoRV 15239 

Pvul 15225 
Pvui 14653 

Xhol 14010 




aadA 

33C01 



z Spc/Str 



pCGN10816 
19386 bp 





RB 
oolylinker 

d35S 

Turbo pel 1 

Turbo 



Pstl 1670 

Pvull 2124 
Pvull 2193 
Smal 2579 

Xbal 2720 
2795 
II 2954 

il 2973 
TiHI 2980 
2997 



v nKer 
35S promoter 

Tn5 Ran gene 



Pvul 13579 
Drai 13424 
Sad 13305 
Pvull 13295 
Bgill 13023 
Xhol 12412 

Pvul 11904 1 
Pvull 11769 
Saci 11657 
Bgill 11634 
Smal 11619 
EcoRl 11289 
EcoRl 11141 
Pvul 11001 
Pvull 10752 
Ncol 10658 
EcoRV 10602 

Sail 9965 
Pvul 9930 
Xhol 9900 
Bgill 9844 
EcoRV 9713 




Pvull 4244 
Pvul 4275 
Xhol 4324 
Pvull 4560 

EcoRV 4855 
— rSali 5065 
pv EcoRl 5077 
- Tvull 5332 
Ncol 5658 
EcoRl 6067 
Smal 6074 



Pvul 



Xhol 7216 
Pvul 7226 
Dral 7678 
Pvul 7762 
Xhol 8207 
Smal 8371 
EcoRV 8402 
EcoRV 8574 
EcoRV 8864 
9514 



Figure 15 



EcoRI 5107 
BamHl 5076 

Sad 5074 
Kpnl 5068 
Hindi!! 5058 \\ 
Pvull 4879 -\ I' 



BamHi 1 
Notl 8 

/Stui 179 
/ rStuI 332 




Stui 824 
EcoRV 896 
Xhol 1090 
Pstl 1205 
BamHl 1206 
EcoRI 1217 
Pstl 1226 
EcoRV 1229 
Notl 1244 
Xhol 1250 
Xbai 1282 

Pvull 1392 
Pvul 1423 



Pvull 2086 

2175 



Pstl 2393 
Pvull 2446 



Figure 16 



Pvull 121 
EcoRi 296 
Sad 306 
Saci 645 



Sac! 19549 



EcoRV 17272 
Bglll 17076 -\ 



Saci 15523 
EcoRV 15425 

Clal 15411 
Pvul 15411 

Pvul 14839 — H 



Xhol 14196 





EcoRV 1549 

Xbai 1578 
EcoRi 1773 
BamHI 2080 
Not! 208' 
H'mdlll 2118 
Smal 2436 
Dral 2461 
Ncoi 2524 
i Pvull 2617 
col 2780 
col 2812 
vlcol 2935 



pofylinke 
Napin Promoter 



Turbo 



pCGN10819 
1 9572 bp 



polylinker 
35S promoter 



Xhol 
Pvul 
Dral 7864 
Pvul 7948 
Xhol 8393 
Smal 8557 
EcoRV 8588 
EcoRV 8760 
EcoRV 9050 
9700 



7402 
7412 




Pvul 13765 
Dral 13610 
Saci 13491 
Pvull 13481 
Bglll 13209 
Xhol 12598 
Pvul 12090 
Pvull 11955 
Saci 11843 
Bglll 11820 
Smal 11805 
EcoRi 11475 
EcoRi 11327 
Pvul 11187 
Pvull 10938 
Ncol 10844 
EcoRV 10788 

Sail 10151 
Pvul 10116 
Xhol 10086 
Bglll 10^30 
EcoRV 9899 



2961 
3068 
BamHI 3387 
EcoRV 4231 
Pvull 4430 
Pvul 4461 
Xhol 4510 
Pvull 4746 

EcoRV 5041 
tTTSall 5251 
^EcoRI 5263 
Pvull 5518 
Ncol 5844 
EcoRi 6253 
Smal 6260 



Figure 17 



t 



Pvufl 121 
EcoRl 296 
Sac! 306 
Ncol 346 

Stul 488 
Ncol 736 

EcoRV 1444 



EcoRV 16940 s 
Bglll 16744 ~\\ 



Saci 15191 
EcoRV 15093 

Ciai 15079 
Pvul 15079 

Pvul 14507 

Xhol 13864 

Pvul 13433 
Dral 13278 / 
Sacl 13159 
Pvull 13149 
Bglil 12877 
Xhol 12266 
Pvul 11758 
Pvuli 11623 
Sacl 11511 
Bglll 11488 
Smal 11473 
EcoRl 11143 
EcoRl 10995 
Pvui 10855 
Pvull 10606 
Ncol 10512 
EcoRV 10456 

Sail 9819 
Pvul 9784 
Xhol 9754 
Bglll 9698 



BaoiHI 1859 
.oti 1686 
Pvul 1735 
EcoRl 1759 
Smal 1825 
EcoRV 2158 
col 2426 
EcoRV 2654 
2731 
2845 
2851 




Pvull 4098 
Pvul 4129 
Xhol 4178 
Pvull 4414 
EcoRV 4709 
Sail 4919 
EcoRl 4931 
Pvull 5186 
Ncol 5512 
EcoRl 5921 
Smal 5928 



Pvul 
EcoRV 9567 



Xhol 
Pvul 
Dral 7532 
Pvul 7616 
Xhol 8061 
Smal 8225 
EcoRV 8256 
EcoRV 8428 
EcoRV 8718 
9368 



7070 
7080 



Figure 18 



Pvull 121 

EcoRl 296 

Sact 306 

Saci 645 

EcoRV 1549 



Sad 19747 i 



EcoRV 17470 
Bglll 17274 ^ 



Sad 15721 
EcoRV 15623 

Cial 15609 
Pvul 15609 



Pvul 15037 





poiyiinker 
Napin Promoter 

Turbo hnker 

ATPT12 * 
Turbo linker 

Napin 3' 



Xhol 14394 




pCGN10825 
1 9770 bp 



EcoRl 1773 
BamHl 2080 
Not! 2087 
Pvul 2156 
EcoRl 2180 
Smal 2246 
EcoRV 2579 
Ncol 2847 
EcoRV 3075 
Dral 3152 
Psff 3268 
BamHl 3585 

EcoRV 4429 
Pvull 4628 
Pvul 4659 
Xhol 4708 
Pvull 4944 



polylinke 



35S promoter ^|_ EcoRV ^ 

Sail 5449 



Pvul 13963 
Dral 13808 
Sad 13689 
Pvul! 13679 
Bglll 13407 
Xhol 12796 
Pvul 12288 
Pvull 12153 
Sacl 12041 
Bglll 12018 
Smal 12003 
EcoRl 11673 ' 
EcoRl 11525 
Pvul 11385 
Pvull 11136 
Ncol 11042 
EcoRV 10986 1 

Sail 10349 
Pvul 10314 
Xhol 10284 
Bglll 10228 
EcoRV 10097 
Pvul 9898 



n5 kan gene 




EcoRl 5461 
Pvull 5716 
Ncol 6042 
EcoRl 6451 
Smal 6458 



Xhol 7600 
Pvul 7610 
Dral 8062 
Pvul 8146 
Xhol 8591 
Smal 8755 
EcoRV 8786 
EcoRV 8958 
EcoRV 9248 



Figure 19 



Pvuil 121 
EcoRl 296 
Saci 306 
Ncol 346 
StuI 486 
Ncol 736 

EcoRV 1444 



Sad 19019 



EcoRV 16742 
Bglll 16546 -\ 



Sad 14993 
EcoRV 14895 

Clal 14881 
Pvul 14881 



Pvul 14309 — IE 



Xhol 13666 

Pvul 13235 
Dral 13080 
Saci 12961 
Pvul I 12951 
Bglll 12679 
Xhol 12068 
Pvui 11560 
Pvuil 11425 
Saci 11313 
Bglll 11290 
Sma! 11275 
EcoRi 10945 
EcoRl 10797 
Pvul 10657 
Pvuil 10408 
Ncol 10314 
EcoRV 10258 ; 

Sail 
Pvui 
Xhol 



O 

poiylinker 

d35S 

Turbo poly 

Turbo 



pCGN10826 
1 9042 bp 





BarnHl 1659 
Not! 1666 
Bind! ii 1697 
Srhai 2015 
Dral 2040 
Ncol 2103 
Pvuil 2196 
Ncol 2359 
Ncol 2391 
2514 
2540 
2647 
2653 



35S promoter ^ 

n5 kan gene 



tmi 3 s 




Pvuil 3900 
Pvui 3931 
Xhol 3980 
Pvul! 4216 
EcoRV 4511 
Sail 4721 
fcH- 4 - EcoRl 4733 
Pvul! 4988 

Ncol 5314 

EcoRl 5723 
Smal 5730 



Xhoi 6872 
Pvui 6882 
Dral 7334 
Pvul 7418 
Xhol 7863 
Smal 8027 
EcoRV 8058 
EcoRV 8230 
EcoRV 8520 
Pvul 9170 
EcoRV 9369 
Bglil 9500 



9621 
9586 
9556 



Figure 20 
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Plant line number 



Figure 24 



> Plant line number 



Figure 25 
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Query Sequence; F4D11 AL022537 

Database: PIRJT04448 . atcea. list . fasta 

Database; PIFJT04448 
Plus (+) denotes forward strand, and minus (-) reverse strand 
Asterisks (*) denote bases not shown on pair wise alignments. 

Alignment 1 



Query- 



ATCEA4C371+ 



1219 4 CACACGTTCTCGTCCTTTTCTTCTTCCTCTCTGCATTCTTCACAGAGTTTGTCACCACCA 



1 



C ss 



o *- 



Met 

Query- 
ATCEA4C37 i+ 



12134 




11 I I 1 E ! i 



I M i i 11 I I I 1 i 1 I I I I i 1 I 1 I 1 1 H I I I 1 1 i I M I I i I I i I ! M I I I 



2 ACCCCAAACATCACAATTTCACATTCTTTTGCATATTTCTTCTTCTTCTTCCATTATGGA 



Query- 



&TCEA4C371- 



1207 5 GATACGGAGCTTGATTGTTTCTATGAACCCTAATTTATCTTCCTTTGAGCTCTCTCGCCC 

! I i I M i If I ! J 1 I I I I i J ) 1 I I ! I I t I ! [ I 1 I ! I I I I M I i 1 i I i I i II M M I ! I S i I 

62 GATACGGAGCTTGATTGTTTCTATGAACCCTAATTTATCTTCCTTTGAGCTCTCTCGCCC 



Query- 



ATCEA4C371 



12 015 TGTATCTCCTCTCACTCGCTCACTAGTTCCGTTCCGATCGACTAAACTAGTTCCCCGCTC 

I f II I i i I 1 I 1 I 11 i 1 M i M ; ! f 1 ] I I I N ! ! ! H i I j I i i ! j I 1 i I I I 1 t I M I I I I! 

122 TGTATCTCCTCTCACTCGCTCACTAGTTCCGTTCCGATCGACTAAACTAGTTCCCCGCTC 



Query 



ATCEA4C371+ 



11955 catttctagggtttcgBHH^tctccaccccgaatagtgaaactgacaagatctccgt 

! U M I M \ \ \ \ \ ! ! I ! ( I M i I I I ! i i i I I ( i I ! I f I M I 1 M i M ! M M M I I II i I 

182 CATTTCTAGGGTTTCGGCGTCGATCTCCACCCCGA&TAGTGAAACTGACAAGATCTCCGT 



Query- 



ATCEA4C371+ 



11895 TAAACCTGrTTACGTCCCGACGTCTCCCAATCGCGAACTCCGGACTCCfClCA^GGGTA 
M M I M M I III II I I i M I i i I I [ ! [ I I ! | j | I [ i! | [ | M | f [ m ff | '[ j | | | — 
242 TAAACCTGTTTACGTCCCGACGTCTCCCAATCGCGAACTCCGGACTCpS^C^G^GG 



Synecho sec alicrns froia 



Query- 
ATCEA4C37 1+ 



Query- 

ATCEA4C371+ 

PIR:T04448 

Query- 

ATCEA4C371+ 

PIR:?04448 



Query 



ATCEA4C371+ 



P1R:T04448 



Query' 



ATCSA4C371+ 



PIR:T04448 



Query- 



11835 AATTGATCCATTCCATTCCATTTCTCTTCTCTTGTTTGTTTTATTAAGCTCCAATTTCAG 

299 

~~ 6G bp removed — 

* + * 

: . : 

11715 *********************** *********** V ****^*±**±**±******** TTTG 
1 

11 655 GTGGCTCACCATTCGACGACTACTTTTGAATTTGAGTTTTTGAAAAATGCAATTTAACAT 

" * "*™ ***** ™ ' **- "~ — ' — — ~ " — -~ — — — — * »" — vs— i . * • * » * • +j * + « 

1 M Q F N I 

drab seaueoce m i er> ib inx:ortec 

115S5 CAGAGAGTTTTTTTTTTTATGGTTGATAACTTATTGTTTAACTTTTGAAAAATGCAGA^A 

299 ill 
r y A^A 

6REFFFLWLITYCLTFEKCRY 
11535 CCATTTCGATGGAACACCTCGGAAGTTCTTCGAGGGATGGTATTTCAGGGTTTCCATCCC 

_ i M I i I ( M [ ! I If I [ I I ! [ N I M |[ li I I 1 I ; I i i i M I ! M M I j I I 1 f I I I ! I f j ! 

302 CCATTTCGATGGAACACCTCGGAAGTTCTTCGAGGGATGGTATTTC^^^^TCCATCCC 
26HFDGTPRKFFEGWYF HB SIP 

: . : 

11475 AGAGAAGAGGGAGAGTTTTTGTTTTATGTATTCTGTGGAGAATCCTGCATTTCGGCAGAG 



Figure 31 
1/5 



I ! I I i I 1 I I M I i I i I 1 [ I I I I I I I i i II I i I I i M M M t I i i i I I I i ! 1 h M 1 M 1 I 

362 AG AGA AG AGGG AG AGT TTTTGTTTTAT GT AT T CT G T GG AGAAT C CT GC AT T T C GGC AG AG 



* * » » 4 



ATCEA4C371+ 

PIR:T04448 46EKRESFCFMYSVENPAFRQS 



Query- 
ATCEA4C371+ 



PIR:T04448 



11415 TTTGTCACCATTGGAAGTGGCTCTATATGGACCTAGATTCACTGGTGTTGGAGCTCAGAT 

] i M I I I I I I i I I 1 I M 1 I I 1 i I I M 1 1 I I [ t 1 1 I I I I I M I I I i i 1 M i I I I ! f I i I i f 

422 TTTGTCACCATTGGAAGTGGCTCTATATGGACCTAGATTCACTGGTGTTGGAGCTCAGAT 
66L S P L E V A L YG P RFT GVGAQ I 



Query' 



PIR:T04448 
ATCEA4C371+ 



11355 TCTTGGCGCTAATGATAAATATTTATGCCAATACGAACAAGACTCTCACAATTTCTGGGG 



I ! 



ATCEA4C371+ 482 TCTTGGCGCTAATGATAAATATTTATGCCAATACGAACAAGACTCTCACAATTTC 



86LGAWDKYLCQYEQDSHNFWG 
Exon 11538 11301 Confidence: 100 100 



Query- 
ATCEA4C371+ 

PIR:T04448 
PIR:T04448 



112 95 AG GT A AC T CC T T G A C C C T T A AAAT GCTGTGTCAT G AC A AT A AG A AAT CAT AT CT G AGT C T 



537 



106 D 

Exon 



11609 11294 Confxdence: 100 100 



Query- 
PIR:T04448 



11235 TTTCTCTACTTCTAGTACTAATGTTCGTTATTGTTGTTAAAGATCTAAGTCTTATCTGAA 



107 



Query- 
PIR:T04448 



11175 TTTTGTTACATTTTGGTTCTGGTGCTTTCTCAACATGAATTTGTATATATGACTTTAAAG 



107 



Query- 
PIR:T04448 



11115 ATTGCTTACCTAAAGTTTTTACTCATGCATAGATCGACATGAGCTAGTTTTGGGGAATAC 



107 



RHELVLGNT 



Query- 

PIR:T04448 
PIR:T04448 



11055 TTTTAGTGCTGTGCCAGGCGCAAAGGCTCCAAACAAGGAGGTTCCACCAGAGGTTCTCAC 

116FSAVPGAKAPNKEVPPE 
Exon 11083 11004 Confidence: 96 100 



Query- 
PIR:T04448 



1 C 9 9 5 TCCTCCCT rGTTGGTTACTTTGTTATCTGTTAAATAGTTTTCCAATTGTATCCGGATAGT 



133 



Query- 
PIR:T04448 



10935 GTTCTACTTCTCCTTGTAGAAAATCTCAAGTTTTTGTTACTCTTGCTATTCTCTTGGAT 



133 



Query- 
PIR:T04448 



1087 5 TTGATTTGTAAAGCATGTCGTTTTATTGTAGGAATTTAACAGAAGAGTGTCCGAAGGGTT 



133 



EFNRRVSEGF 



Query- 

PIR:T04448 
?IR:T04448 



10815 CCAAGCTACTCCATTTTGGCATCAAGGTCACATTTGCGATGATGGCCGGTAATTATATGA 

143 QATPFVJHQGHI CDDGR 
Exon 10844 10768 Confidence: 100 100 



Query- 
PIR:T04448 



10755 TTCTATGCACAACAAGAA'T TCACTATATTATAAATATTGGATATTGAGTATTTTTGTTGA 



159 



Query- 
?IR:T04448 



10695 AAAT T TCT GTGTTT AAAT CTGACTTGACTTGTTTTGTC AGT ACT GACTATGCGGAAACTG 



159 



T D Y A E T V 



Figure 31 
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Query- 10635 TGAAATCTGCTCGTTGGGAGTATAGTACTCGTCCCGTTTACGGTTGGGG'TGATGTTGGGG 

?IR:T04448 166 * ' K " * S * * A R W E Y S T R P V Y G W G D V G A 

' * ■ * . ; * i * I 

Query- 10575 CCAAACAGAAGTCAACTGCAGGCTGGCCTGCAGCTTTTCCTGTATTTGAGCCTCATTGGC 

: I;;:::::::::;::::::::;:::::::::::::: r :::::::::::::::::: I : 
PIR:?04443 186 *"k"q*"k"s T A G W PA A F P V F E P H W Q 

Query- 10515 AGATATGCATGGCAGGAGGCCTTTCCACAGGTGTGAGCTTTGCTTGATTGACTTAAAGTT 



PIR:T04448 206 ICMAGGLSTG 

PIR:T04448 Exon ^ 10655 10486 Confidence: 96 100 

Que ry- 10455 AATAAATAGACGGTT AAGTTT ACTTGCCTAGT ACTAACAGAAAATT AAGAAAGAAACC AC 
P1R:T04448 216 

Query- 10395 CCTCTTTCTATCAGCAGAAACTGCTATTGTAGTTCTTATTTTTTCTCTTGTATTTGCAGG 
PIR:T04448 216 

Query- 10335 GTGGATAGAATGGGGCGGTGAAAGGTTTGAGTTTCGGGATGCACCTTCTTATTCAGAGAA 

• * • * » • * * * ir,#-,*^, "T - iiiii *** ****** 

?IR:T04448 216 WI^WGGERFEFRDAPSYSSK 

Query- 10275 GAATTGGGGTGGAGGCTTCCCAAGAAAATGGTTTTGGGTAAAACATTTCATCCTTTTGCT 

PIR:T04448 236 NWGGGFPRKWFW 

PIR:T04448 Exon 10336 10239 Confidence: 96 100 

Query- 10215 ACATTTCTTGTTGCAGACTTTAGTTAGCTAGTGGACCTGTGTATACACCCACATGTAGTA 
PIR:T04448 248 

Query- 10155 TACTTGTTTGATAGCT7TATTTGTCAATGTCTCTTTACAGGTCCAGTGTAATGTCTTTGA 
PIR:T04448 248 V Q C N V F E 

• m * * m * 

*4**a4*4 4- 4 -p * 

Que ry- 10095 AGGGGCAACTGGAG AAGTTGCTTTAACCGCAGGTGGCGGGTTGAGGC AATTGCCTGGATT 

PIR:T04448 255 GATGEVALTAGGGLRQLPGL 

■mm** a # 

»*«V*44<4 

Query- 10035 GACTGAGACCTATGAAAATGCTGCACTGGTATGCACTTATAAGATCTTCTTAAGCAATGA 



***#a*44*-** 



PIR:T04 448 275 TETYENAAL 

PIR:T04448 Exon 10115 10008 Confidence: 100 100 

Query- 9975 CAGTGAGTATTAGAAGGCAGATAGTTTACAAAAGCTCTGGGCCCTTGTAAATCTGCAGGT 

_ , — ^ 4— _ _, .. ^ _ |J _ 1X ^ lrll - _ _ _ _ _^ _ * 4 

* *■ 

PIR:T04448 284 V 

* 4 * W 4 * 

Query- 9915 TTGTGTACACTATGATGGAAAAATGTACGAGTTTGTTCCTTGGAATGGTGTTGTTAGATG 

PIR:T04448 285 CVHYDGKMYEFVPWNGVVRW 

| | | j [ | 

GSDB:S:495- 532 tagatg 

Query- 9855 GGAAATGTCTCCCTGGGG TTATTGGTATATAACTGCAGAGAACGAAAACCATGTGGTAA 

PIR:T04448 305 EMSPWG YWYITAENENHV 

I I II i l-l 1 M I I M ! i 1-1 I i I 1 f I M M I I t i ! I I M ! 1 I it I I t I M 1 I I I 

GSDB:S:495- 52 6 ggaaat tctccctgggggttattggtatataactgcagagaNcgNaaaccatgtg 
PIR:T04448 Exon 9917 9801 Confidence: 100 100 

GSDB:S:495- Exon 9861 9801 Confidence: 93 93 

Query- 979 6 ATTTGTTTTACTAGTTTCATTCAGTTTTACTTTTGACATCATATCATTCCCTTATGGCTA 
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PIR:T04448 



323 



GSDB:S:495- 471 

Query- 9736 GATTCCAACACCCGATGAATGTCTTGTGACAGGTGGAACTAGAGGCAAGAACAAATGAAG 

PIR:T04448 323 VELEARTNEA 

j |[ | | j || | i [ | j j | j | | | | [ i | | | | | 

GSDB: 5: 495- 471 gtggaactagaggcNagaacaaatgaag 

Query- 967 6 CGGGTACACCTCTGCGTGCTCCTACCACAGAAGTTGGGCTAGCTACGGCTTGCAGAGATA 



PIR:T04448 333 GTPLRAPTTEVGLATACRDS 

I i ! i 1 1 i I t I I ! i I I I ! i I I I 1 I I ! ! I i I i f I I I I ! M 1 I N I I i ( I 1 i ! I I 1 I I i I II i 
GSDB:S: 495- 443 cgggtacacctctgcgtgctcctaccacagaagttgggctagctacggcttgcagagara 

* 4 4 * * * 

Query- 9616 GTTGTTACGGTGAATTGAAGTTGCAGATATGGGAACGGCTATATGATGGAAGTAAAGGCA 

PIR:T04448 353 CYGELKLQIWERLYDGSKGK 

j i M i I i 1 i i II I M M I I i I ! ! 1 M i M I M S I i I i M i I I I ! i 1 I I 1 I i t M I 1 i I I I 
GSDB: S: 495- 383 gttgttacggtgaattgaagttgcagatatgggaacggctatatgatggaagtaaaggca 

Query- 9556 AGGTATGTATGCTAATGTGATCCAATCCCTGTAGTTAAAAGTCTTAACAAATCCTAAGGC 

r 4 _ ___ _^ _ rr _ , , ^ > _^ ^_ 4 4 * + 4 » 4 + *41 4 4*44 

PIR:T04448 373 LKVLTNPKA 

| | 

GSDB:S:495- 323 ag 

PIR:T04448 Exon 9704 9555 Confidence : 100 100 

GSDB:S:495- Exon 9704 9555 Confidence: 98 100 

Query- 9496 AGTGAAAGAAGATTATGAACGTTTGTTATGGTTAACAATGATGCAGGTGATATTAGAGAC 

PIR:T04448 382 VKEDYERLLWLTMMQVILET 

| 1 | | 1 M I I I II 1 1 

GSDB:S:495- 321 gtgatattagagac 

* * * » t 

Query- 943 6 AAAGAGCTCAATGGCAGCAGTGGAGATAGGAGGAGGACCGTGGTTTGGGACATGGAAAGG 

PIR:T04448 402 KS SMAAVE I G GGPWFGTWKG 

I M I I I I I II I I ! I I I I I I I I I M 1 M I I I I I ! ! 1 I II I ! I ! I i I ! ! M ! I I I M I I I I 
GSDB: S : 4 95- 307 aaagagctcaatggcaNcagtggagataggaggaggaccgtggtttgggacatggaaagg 

+ * • • > w 

fr***4*4«*-a* a 

Query- 937 6 A GAT AC GA GC AAC ACG C C C G AGC T AC T AA AAC AG GC T C T T C AGGT C C CAT T GG AT C T T G A 

PIR:T04 44 8 422 DT S NTPELLKQALQVPLDLE 

I I I M I I M I i I i II M M I I i I I I I M I M i I I I i i I I I M I M I I I i I I I I M I I I M 

GSDB : S : 495- 247 agatacgagcaacacgcccgagctactaaaacaggctcttcaggtcccattggatcttga 




Query- 9316 AAGCGCCTTAGGTTTGGTCCCTTTCTTCAAGCCACCGGGTCTGjEI 

(stop) 

PIR:T04448 442 SALGLVPFFKPPGL 

I H II I I I 1 M I I 1 I I I I I I I I I M f I I I I i I I I i I I f ! ! M II I M I I M I I M I I I I I 
GSDB: S : 495- 187 aagcgccttaggtttggtccctttcttcaagccaccgggtctgtaacattgatgagtgtt 

PIR:T04448 Exon 9522 9274 Confidence: 100 100 



Query- 9256 
PIR:T04448 456 

1 I I M I I ! I 1 I I I ! i I I ! I I I I M M I I i I II I i I I M II I I M I I I II I I i I I I ! M 1 I 
GSDB : S : 495- 127 ttgtttgttgatagagacccatgtgatgaatgaagccttagtcatgtcattgctagcttc 

» ■ * 4 » 4 

* 4 4 4 » * 4 4 ■ * 4 4 

Query- 9196 ACTATTATGTATGTATGATTTTAGTTCGTTCGGTCCTTGTGGTAAATGATACGGGCCAGT 

f M 1 II I M I I ! I I I ! I I I If I I I I I I i I II I II I ! 1 I I i M I I I I i i ! I I I li M I M i 
GSDB : S : 495- 67 actattatgtatgtatgattttagttcgttcggtccttgtggtaaatgatacgggccagt 

* * 4 4 m 

•**444»4*44# 

Query- 9136 GTAAAGTCTAGTTCAATAAAAGCCTTGAGTCGCATAATTTCAATTTCAAATTGCATC 

I 1 I I I I I 

GSDB:S:495- 7 graaagt 

GSDB:S:495- Exon 9450 9130 Confidence: 98 100 
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ATCSA4C37145 1 3063693/emb ! CAA18584 . 1 | 4.0e-43 (AL022537) putative protein 
[Arabidopsis thaliana] 

PIR-T04448 SPIR-T04448 shypothetical protein F4D11.30 - Arabidopsis thaliana; 
g3063693|embjCA7Ai8584.1 (AL022537) putative protein [Arabidopsis thaliana] _F4D11 . 30 

GSDB:S:4955486[A!995392!Al995392i701673779 A. thaliana, Columbia Col-0, inf lorescence- 
i Arabidopsis thaliana cDKA clone 701673779, mRNA sequence. 
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M 

MEIRSLIVSMNPNLSSFELSRPVSPLTRSLVPFRSTKLVPRSISRVSASI 



CFI_ARATH_P4 1 0 8 8_ 

slrl737_SYNSP_S74 814 
slr!737_ARATH_T04 4 4 8_ 
CFI_ARATH_P4108 8_ 

slrl7 37__SYNSP__S7 4 814 
slrl737_ARATH_T044 4 8" 
CFI_ARATH_P4108 8_ 

slrl737_SYNSP_S7 4814 
slr!7 37_ARATH_T04 4 48^ 
C F I_AR ATH__P 4 1 0 8 8_ 

s!rl7 37__SYNSP_S7 4 814 
slrl737_ARATH__T044 4 8 
CFI_ARATH__P4 1 0 8 8_ 

slr!737_SYNSP_S74814 
slrl737_ARATH_7044 4 8 
CFI_ARATH JP 4 1 0 8 8_ 

slrl737_SYNSP_S74 814 
slrl737_ARATH_T044 4 8 
CFI__ARATH_P4108 8_ 

slrl737_SYNSP_S74814 
slrl737_ARATH_T04 44 8 
CFI_ARATH_P4108 8_ 

slrl737_SYNSP_S7 4 814 
slrl7 37__ARATH_T044 4 8 
CFI_ARATH__P4 1 0 8 8_ 

slr!737_SYNSP_S74814 
slrl737_ARATH_T044 4 8 
CFI ARATH P41088 



KFP PHS GYHWQGQS - P FFEGW Y VRLL 

STPNSETDKISVKPVYVPTSPNRELRTPHSGYHFDGTPRKFFEGWYFRVS 



LPQSGESFAFMYSIENPASDHHYGGGAVQILGPATK KQENQEDQLV 

IPEKRESFCFMYSVENPAFRQSLSPLEVALYGPRFTGVGAQILGANDKYL 

MSSSNACASPSPFPA VTKLHVDSV- 

WRTFPSVKKFWASPRQFALG-HWGKCRDNRQ-A'KPLLSEEFFATVKEGYQ 
CQYEQDSHNFWGDRHELVTGNTFSAVPGAKAPNXEVPPEEFNRRVSEGFQ 
— TFVPSVKSPASSNPLF— G-GAGVRGLDIQ-GK FVIFTVIGVY 

IHQNQHQGQI IHGDR HCRWQFTVEPEVTWGSPNRFPRA7AGW 

ATPFWHQGHICDDGRTDYAETVKSARWEYSTRPVYGWGDVGAKQKS?AGW 
LEGNAVPSLSV KWKGKTTEELTES I PFFRE IV" GAF 

LSFLPLFDPGWQILLAQGRAHGWLKWQREQYEFDHALVYAEKNWGHSFPS 
PAAFPVFEPHWQICMAGGLSTGWIEWGGERFEFRDAPSYSEKNWGGGFPR 
EKFIKVT M KLPLTGQQYSEKVTENC 

RWFtfLQANYFPDHPG-LSVTAAGGERIVLGRPE EVALIGLHHQGNFY 

KWFWVQCNVFEGATGEVALTAGGGLRQLPGLTETYENAALVCVHYDGKMY 
VAIWKQLGLYTDCEA-KAV EKFLEIFKE ET 

EjGPGHGTVTWQVAJWGRWQLKASNDRYWVKLSGKTDKKGSLVHTP-TAQ 
E FV PWNGVVRWEMS P WGYWY I T AENENHVVELEARTNE AGT PLRAP T TEV 
-F??G-SSILFALS?TG SLTVAFSKDDS-IPETGIAVISNKLLAEA-VLE 

GLQLNCRDTTRGYLYLQLGSVGHG LIVQGETDTAGLEVGG 

GLATACRDSCYGELKLQIWERLYDGSKGKVILETKSSMAAVEIGGGPWFG 
--SIIGKNGVSPGTRLSVAERLSQ LMMKNKDEKEVSDHSL 

DWGLTEENLSKKT VPF 

TWKGDTSNTPELLKQALQVPLDLESALGLVPFFKPPGL 
EEKLAKEN 
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SEQUENCE LISTING 

<110> Lassner, M 
5 Post-Beittenmiller, D 

Savidge, B 
Weiss, J 

<120> Nucleic Acid Sequences Involved in 
10 Tocopherol Synthesis 

<130> 17133/02/US 

<150> 60/129,899 
%5 <151> 1999-04-15 

p mi 1 

■0 <150> 60/146, 461 
;y <151> 1999-07-30 

"w *\ 
* -1 -f 

SO <160> 94 

■ U¥U 

r <170> FastSEQ for Windows Version 4.0 

r <2io> l 

;l5 <211> 1182 
^ . <212> DNA 

<213> Arafoidopsis sp 



<400> 1 

30 atggagtctc tgctctctag ttcttctctt gtttccgctg ctggtgggtt ttgttggaag 60 

aagcagaatc taaagctcca ctctttatca gaaatccgag ttctgcgttg tgattcgagt 120 

aaagttgtcg caaaaccgaa gtttaggaac aatcttgtta ggcctgatgg tcaaggatct 180 

tcattgttgt tgtatccaaa acataagtcg agatttcggg ttaatgccac tgcgggtcag 240 

cctgaggctt tcgactcgaa tagcaaacag aagtctttta gagactcgtt agatgcgttt 300 

35 tacaggtttt ctaggcctca tacagttatt ggcacagtgc ttagcatttt atctgtatct 360 

ttcttagcag tagagaaggt ttctgatata tctcctttac ttttcactgg catcttggag 420 

gctgttgttg cagctctcat gatgaacatt tacatagttg ggctaaatca gttgtctgat 480 

gttgaaatag ataaggttaa caagccctat cttccattgg catcaggaga atattctgtt 540 

aacaccggca ttgcaatagt agcttccttc tccatcatga gtttctggct tgggtggatt 600 

4 0 gttggttcat ggccattgtt ctgggctctt tttgtgagtt tcatgctcgg tactgcatac 660 



1 
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t ctatcaatt 


tgccactttt 


acggtggaaa 


agatttgeat 


tggttgcagc 


aatgtgtatc 


720 


ctcgctgtcc 


gagctattat 


tgttcaaatc 


gecttttate 


tacatattca 


gacacatgtq 


780 


t ttggaagac 


caatcttgtt 


cactaggcct 


cttattttcg 


ccactgcgtt 


tatgagcttt 


840 


tt ctctgt eg 


ttattgeatt 


gtttaaggat 


atacctgata 


tcgaagggga 


taagatattc 


900 


ggaatccgat 


cattctctgt 


aactctgggt 


cagaaaeggg 


tgttttggac 


atgtgttaca 


960 


ctacttcaaa 


tgqcttacqc 


tqttcfcaatt 


ctacrtt acraa 


ccaratnt rr 




1UZU 


agcaaagtca 


tctcggttgt 


gggtcatgtt 


atactegcaa 


caactttgtg 


ggctcgagct 


1080 


aagtccgttg 


atctgagtag 


caaaaccgaa 


ataacttcat 


gttatatgtt 


catatggaag 


1140 


ctcttttatg 


cagagtactt 


gctgttacct 


tttttgaagt 


ga 




1182 



<210> 2 
<211> 393 
<212> PRT 

<213> Arabidopsis sp 



<400> 2 

Met Glu Ser Leu Leu Ser Ser Ser Ser Leu Val Ser Ala Ala Gly Gly 

1 5 10 15 

Phe Cys Trp Lys Lys Gin Asn Leu Lys Leu His Ser Leu Ser Glu He 

20 25 30 

Arg Val Leu Arg Cys Asp Ser Ser Lys Val Val Ala Lys Pro Lys Phe 

35 40 45 

Arg Asn Asn Leu Val Arg Pro Asp Gly Gin Gly Ser Ser Leu Leu Leu 

50 55 60 

Tyr Pro Lys His Lys Ser Arg Phe Arg Val Asn Ala Thr Ala Gly Gin 
65 70 75 80 

Pro Glu Ala Phe Asp Ser Asn Ser Lys Gin Lys Ser Phe Arg Asp Ser 

85 90 95 

Leu Asp Ala Phe Tyr Arg Phe Ser Arg Pro His Thr Val He Gly Thr 

100 105 ' 110 

Val Leu Ser He Leu Ser Val Ser Phe Leu Ala Val Glu Lys Val Ser 

115 120 125 

Asp He Ser Pro Leu Leu Phe Thr Gly He Leu Glu Ala Val Val Ala 

130 135 140 

Ala Leu Met Met Asn He Tyr He Val Gly Leu Asn Gin Leu Ser Asp 
145 150 155 160 

Val Glu He Asp Lys Val Asn Lys Pro Tyr Leu Pro Leu Ala Ser Gly 

165 170 175 

Glu Tyr Ser Val Asn Thr Gly He Ala He Val Ala Ser Phe Ser He 

180 185 190 

Met Ser Phe Trp Leu Gly Trp He Val Gly Ser Trp Pro Leu Phe Trp 



2 
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195 

Ala Leu Phe Val 
210 

Pro Leu Leu Arg 
225 

Leu Ala Val Arg 

Gin Thr His Val 

260 

Phe Ala Thr Ala 
275 

Lys Asp lie Pro 
290 

Phe Ser Val Thr 
305 

Leu Leu Gin Met 

Pro Phe lie Trp 

340 

Ala Thr Thr Leu 

355 

Thr Glu He Thr 
370 

Glu Tyr Leu Leu 
385 



200 

Ser Phe Met Leu 
215 

Trp Lys Arg Phe 
230 

Ala He He Val 
245 

Phe Gly Arg Pro 

Phe Met Ser Phe 

280 

Asp He Glu Gly 

295 

Leu Gly Gin Lys 
310 

Ala Tyr Ala Val 
325 

Ser Lys Val He 

Trp Ala Arg Ala 

360 

Ser Cys Tyr Met 
375 

Leu Pro Phe Leu 
390 



Gly Thr Ala Tyr 

220 

Ala Leu Val Ala 

235 

Gin He Ala Phe 
250 

He Leu Phe Thr 
265 

Phe Ser Val Val 

Asp Lys He Phe 

300 

Arg Val Phe Trp 

315 

Ala He Leu Val 
330 

Ser Val Val Gly 
345 

Lys Ser Val Asp 

Phe He Trp Lys 

380 

Lys 



205 

Ser He Asn Leu 

Ala Met Cys He 

240 

Tyr Leu His He 
255 

Arg Pro Leu He 
270 

He Ala Leu Phe 
285 

Gly lie Arg Ser 

Thr Cys Val Thr 

320 

Gly Ala Thr Ser 
335 

His Val He Leu 
350 

Leu Ser Ser Lys 
365 

Leu Phe Tyr Ala 



<210> 3 
<211> 1224 
<212> DNA 

<213> Arabidopsis sp 



<400> 3 














atggcgtttt 


ttgggctctc 


ccgtgtttca 


agacggttgt 


tgaaatcttc 


cgtctccgta 


60 


actccatctt 


cttcctctgc 


tcttttgcaa 


tcacaacata 


aatccttgtc 


caatcctgtg 


120 


actacccatt 


acacaaatcc 


tttcactaag 


tgttatcctt 


catggaatga 


taattaccaa 


180 


gtatggagta 


aaggaagaga 


attgcatcag 


gagaagtttt 


ttggtgttgg 


ttggaattac 


240 


agattaattt 


gtggaatgtc 


gtcgtcttct 


tcggttttgg 


agggaaagcc 


gaagaaagat 


300 


gataaggaga 


agagtgatgg 


tgttgttgtt 


aagaaagctt 


cttggataga 


tttgtattta 


360 


ccagaagaag 


ttagaggtta 


tgctaagctt 


gctcgattgg 


ataaacccat 


tggaacttgg 


420 


ttgcttgcgt 


ggccttgtat 


gtggtcgatt 


gcgttggctg 


ctgatcctgg 


aagccttcca 


480 


agttttaaat 


atatggcttt 


atttggttgc 


ggagcattac 


ttcttagagg 


tgctggttgt 


540 
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actataaatg atctgcttga tcaggacata gatacaaagg ttgatcgtac aaaactaaga 600 

cctatcgcca gtggtctttt gacaccattt caagggattg gatttctcgg gctgcagttg 660 

cttttaggct tagggattct tctccaactt aacaattaca gccgtgtttt aggggcttca 720 

tctttgttac ttgtcttttc ctacccactt atgaagaggt ttacattttg gcctcaagcc 780 

tttttaggtt tgaccataaa ctggggagca ttgttaggat ggactgcagt taaaggaagc 840 

atagcaccat ctattgtact ccctctctat ctctccggag tctgctggac ccttgtttat 900 

gatactattt atgcacatca ggacaaagaa gatgatgtaa aagttggtgt taagtcaaca 960 

gcccttagat tcggtgataa tacaaagctt tggttaactg gatttggcac agcatccata 1020 

ggttttcttg cactttctgg attcagtgca gatctcgggt ggcaatatta cgcatcactg 1080 

gccgctgcat caggacagtt aggatggcaa atagggacag ctgacttatc atctggtgct 1140 

gactgcagta gaaaatttgt gtcgaacaag tggtttggtg ctattatatt tagtggagtt 1200 

gtacttggaa gaagttttca ataa 1224 



<210> 4 
<211> 407 
<212> PRT 

<213> Arabidopsis sp 
<400> 4 

Met Ala Phe Phe Gly Leu Ser Arg Val Ser Arg Arg Leu Leu Lys Ser 

15 10 15 

Ser Val Ser Val Thr Pro Ser Ser Ser Ser Ala Leu Leu Gin Ser Gin 

20 25 30 

His Lys Ser Leu Ser Asn Pro Val Thr Thr His Tyr Thr Asn Pro Phe 

35 40 45 

Thr Lys Cys Tyr Pro Ser Trp Asn Asp Asn Tyr Gin Val Trp Ser Lys 

50 55 60 

Gly Arg Glu Leu His Gin Glu Lys Phe Phe Gly Val Gly Trp Asn Tyr 
65 70 75 80 

Arg Leu lie Cys Gly Met Ser Ser Ser Ser Ser Val Leu Glu Gly Lys 

85 90 95 

Pro Lys Lys Asp Asp Lys Glu Lys Ser Asp Gly Val Val Val Lys Lys 

100 105 110 

Ala Ser Trp lie Asp Leu Tyr Leu Pro Glu Glu Val Arg Gly Tyr Ala 

115 120 125 

Lys Leu Ala Arg Leu Asp Lys Pro lie Gly Thr Trp Leu Leu Ala Trp 

130 135 140 

Pro Cys Met Trp Ser He Ala Leu Ala Ala Asp Pro Gly Ser Leu Pro 
145 150 155 160 

Ser Phe Lys Tyr Met Ala Leu Phe Gly Cys Gly Ala Leu Leu Leu Arg 

165 170 175 



Attorney Docket No: 

52015/01AJS 

Gly Ala Gly Cys Thr lie Asn Asp Leu Leu Asp Gin Asp lie Asp Thr 

180 185 190 

Lys Val Asp Arg Thr Lys Leu Arg Pro lie Ala Ser Gly Leu Leu Thr 

195 200 205 

Pro Phe Gin Gly lie Gly Phe Leu Gly Leu Gin Leu Leu Leu Gly Leu 

210 215 220 

Gly lie Leu Leu Gin Leu Asn Asn Tyr Ser Arg Val Leu Gly Ala Ser 
225 230 235 240 

Ser Leu Leu Leu Val Phe Ser Tyr Pro Leu Met Lys Arg Phe Thr Phe 

245 250 255 

Trp Pro Gin Ala Phe Leu Gly Leu Thr He Asn Trp Gly Ala Leu Leu 

260 265 270 

Gly Trp Thr Ala Val Lys Gly Ser He Ala Pro Ser He Val Leu Pro 

275 280 285 

Leu Tyr Leu Ser Gly Val Cys Trp Thr Leu Val Tyr Asp Thr He Tyr 

290 295 300 

Ala His Gin Asp Lys Glu Asp Asp Val Lys Val Gly Val Lys Ser Thr 
305 310 315 320 

Ala Leu Arg Phe Gly Asp Asn Thr Lys Leu Trp Leu Thr Gly Phe Gly 

325 330 335 

Thr Ala Ser He Gly Phe Leu Ala Leu Ser Gly Phe Ser Ala Asp Leu 

340 345 350 

Gly Trp Gin Tyr Tyr Ala Ser Leu Ala Ala Ala Ser Gly Gin Leu Gly 

355 360 365 

Trp Gin He Gly Thr Ala Asp Leu Ser Ser Gly Ala Asp Cys Ser Arg 

370 375 380 

Lys Phe Val Ser Asn Lys Trp Phe Gly Ala He He Phe Ser Gly Val 
385 390 395 400 

Val Leu Gly Arg Ser Phe Gin 

405 

<210> 5 
<211> 1296 
<212> DNA 

<213> Arabidopsis sp 
<400> 5 

atgtggcgaa gatctgttgt ttctcgttta tcttcaagaa tctctgtttc ttcttcgtta 60 
ccaaacccta gactgattcc ttggtcccgc gaattatgtg ccgttaatag cttctcccag 120 
cctccggtct cgacggaatc aactgctaag ttagggatca ctggtgttag atctgatgcc 180 
aatcgagttt ttgccactgc tactgccgcc gctacagcta cagctaccac cggtgagatt 240 
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tcgtctagag ttgcggcttt ggctggatta gggcatcact acgctcgttg ttattgggag 300 

ctttctaaag ctaaacttag tatgcttgtg gttgcaactt ctggaactgg gtatattctg 360 

ggtacgggaa atgctgcaat tagcttcccg gggctttgtt acacatgtgc aggaaccatg 420 

atgattgctg catctgctaa ttccttgaat cagatttttg agataagcaa tgattctaag 480 

5 atgaaaagaa cgatgctaag gccattgcct tcaggacgta ttagtgttcc acacgctgtt 540 

gcatgggcta ctattgctgg tgcttctggt gcttgtttgt tggccagcaa gactaatatg 600 

ttggctgctg gacttgcatc tgccaatctt gtactttatg cgtttgttta tactccgttg 660 

aagcaacttc accctatcaa tacatgggtt ggcgctgttg ttggtgctat cccacccttg 720 

cttgggtggg cggcagcgtc tggtcagatt tcatacaatt cgatgattct tccagctgct 780 

10 ctttactttt ggcagatacc tcattttatg gcccttgcac atctctgccg caatgattat 840 

gcagctggag gttacaagat gttgtcactc tttgatccgt cagggaagag aatagcagca 900 

gtggctctaa ggaactgctt ttacatgatc cctctcggtt tcatcgccta tgactggggg 960 

ttaacctcaa gttggttttg cctcgaatca acacttctca cactagcaat cgctgcaaca 1020 

gcattttcat tctaccgaga ccggaccatg cataaagcaa ggaaaatgtt ccatgccagt 1080 

35 cttctcttcc ttcctgtttt catgtctggt cttcttctac accgtgtctc taatgataat 1140 

■ lb- hi 

* *T 

jjf cagcaacaac tcgtagaaga agccggatta acaaattctg tatctggtga agtcaaaact 1200 

^ cagaggcgaa agaaacgtgt ggctcaacct ccggtggctt atgcctctgc tgcaccgttt 1260 

':g cctttcctcc cagctccttc cttctactct ccatga 1296 



: B0 <21C> 6 
^ <211> 431 
U <212> PRT 

<213> Arabidopsis sp 

H F 

:25 <400> 6 

Q Met Trp Arg Arg Ser Val Val Tyr Arg Phe Ser Ser Arg lie Ser Val 
15 10 15 

Ser Ser Ser Leu Pro Asn Pro Arg Leu lie Pro Trp Ser Arg Glu Leu 

20 25 30 

30 Cys Ala Val Asn Ser Phe Ser Gin Pro Pro Val Ser Thr Glu Ser Thr 

35 40 45 

Ala Lys Leu Gly lie Thr Gly Val Arg Ser Asp Ala Asn Arg Val Phe 

50 55 60 

Ala Thr Ala Thr Ala Ala Ala Thr Ala Thr Ala Thr Thr Gly Glu lie 
35 65 70 75 80 

Ser Ser Arg Val Ala Ala Leu Ala Gly Leu Gly His His Tyr Ala Arg 

85 90 95 

Cys Tyr Trp Glu Leu Ser Lys Ala Lys Leu Ser Met Leu Val Val Ala 

100 105 110 

4 0 Thr Ser Gly Thr Gly Tyr He Leu Gly Thr Gly Asn Ala Ala He Ser 

115 120 125 
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Phe Pro Gly Leu 
130 

Ser Ala Asn Ser 
145 

Met Lys Arg Thr 

Pro His Ala Val 

180 

Leu Leu Ala Ser 
195 

Asn Leu Val Leu 
210 

Pro lie Asn Thr 
225 

Leu Gly Trp Ala 

Leu Pro Ala Ala 

260 

Ala His Leu Cys 
275 

Ser Leu Phe Asp 
290 

Asn Cys Phe Tyr 
305 

Leu Thr Ser Ser 

lie Ala Ala Thr 

340 

Ala Arg Lys Met 
355 

Ser Gly Leu Leu 
370 

Val Glu Glu Ala 
385 

Gin Arg Arg Lys 

Ala Ala Pro Phe 

420 



Cys Tyr Thr Cys 
135 

Leu Asn Gin lie 
150 

Met Leu Arg Pro 
165 

Ala Trp Ala Thr 

Lys Thr Asn Met 

200 

Tyr Ala Phe Val 
215 

Trp Val Gly Ala 
230 

Ala Ala Ser Gly 
245 

Leu Tyr Phe Trp 

Arg Asn Asp Tyr 

280 

Pro Ser Gly Lys 
295 

Met lie Pro Leu 
310 

Trp Phe Cys Leu 
325 

Ala Phe Ser Phe 

Phe His Ala Ser 

360 

Leu His Arg Val 
375 

Gly Leu Thr Asn 
390 

Lys Arg Val Ala 
405 

Pro Phe Leu Pro 



Ala Gly Thr Met 

140 

Phe Glu lie Ser 
155 

Leu Pro Ser Gly 
170 

lie Ala Gly Ala 
185 

Leu Ala Ala Gly 

Tyr Thr Pro Leu 

220 

Val Val Gly Ala 
235 

Gin lie Ser Tyr 
250 

Gin lie Pro His 
265 

Ala Ala Gly Gly 

Arg lie Ala Ala 

300 

Gly Phe He Ala 
315 

Glu Ser Thr Leu 
330 

Tyr Arg Asp Arg 
345 

Leu Leu Phe Leu 

Ser Asn Asp Asn 

380 

Ser Val Ser Gly 
395 

Gin Pro Pro Val 
410 

Ala Pro Ser Phe 
425 



Met He Ala Ala 

Asn Asp Ser Lys 

160 

Arg He Ser Val 

175 

Ser Gly Ala Cys 
190 

Leu Ala Ser Ala 
205 

Lys Gin Leu His 

He Pro Pro Leu 

240 

Asn Ser Met He 

255 

Phe Met Ala Leu 
270 

Tyr Lys Met Leu 
285 

Val Ala Leu Arg 

Tyr Asp Trp Gly 

320 

Leu Thr Leu Ala 
335 

Thr Met His Lys 
350 

Pro Val Phe Met 

365 , 

Gin Gin Gin Leu 

Glu Val Lys Thr 

400 

Ala Tyr Ala Ser 

415 

Tyr Ser Pro 
430 



<210> 7 
<211> 479 
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<212> DNA 



<213> Arabidopsis sp 



<400> 7 



ggaaactccc ggagcacctg tttgcaggta 
ttgtcaggaa ttatgtaagt ctggtggaag 
ctatgatcgg gtttactttg ggtgtgatga 
acccattttt atccaatgct gcattccctc 
gtatcatgct gataaaactg ttactggtac 
cgatggcgtt taaccggtat ctcgacaggc 
tccgtgaaat acctgcgggc gtcatatctg 
gctgcgtggt attctgggtg gcctgttatt 



ccgctaacct taatcgataa tttatttctc 
gctcgcatac catttttgca ttgcctttcg 
gaccaggcgt ggctttatgg tatggcgaaa 
ccgatgattc gttctttcat tcctatacag 
tggtttgtat ggtatcagca agaagcgcgg 
attttgacgc gaagaacccg cgtactgcca 
ccaacagtgc gctggtgttt acgataggct 
tcattaacac gatctgtttt tacctggcg 



479 



420 



360 



240 



300 



180 



120 



60 



<210> 8 
<211> 551 
<212> DNA 

<213> Arabidopsis sp 
<220> 

<221> misc_f eature 
<222> (1) . . . (551) 
<223> n = A,T,C or G 



<400> 8 














ttgtggctta 


caccttaatg 


agcatacgcc 


agnccattac 


ggctcgttaa 


tcggcgccat 


60 


ngccggngct 


gntgcaccgg 


tagtgqgcta 


ctgcgccgtg 


accaatcagc 


ttgatctagc 


120 


ggctcttatt 


ctgtttttaa 


ttttactgtt 


ctggcaaatg 


ccgcattttt 


acgcgatttc 


180 


cattttcagg 


ctaaaagact 


tttcagcggc 


ctgtattccg 


gtgctgccca 


tcattaaaga 


240 


cctgcgctat 


accaaaatca 


gcatgctggt 


ttacgtgggc 


ttatttacac 


tggctgctat 


300 


catgccggcc 


ct cttagggt 


atgccggttg 


gatttatggg 


atagcggcct 


taattttagg 


360 


cttgtattgg 


ctttatattg 


ccatacaagg 


attcaagacc 


gccgatgatc 


aaaaatggtc 


420 


tcgtaagatg 


tttggatctt 


cgattttaat 


cattaccctc 


ttgtcggtaa 


tgatgcttgt 


480 


ttaaacttac 


tgcctcctga 


agtttatata 


tcgataattt 


cagcttaagg 


aggcttagtg 


540 


gttaattcaa 


t 










551 



<210> 9 
<211> 297 
<212> PRT 

<213> Arabidopsis sp 
<400> 9 
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Met Val Leu Ala Glu Val Pro Lys Leu Ala Ser Ala Ala Glu Tyr Phe 

15 10 15 

Phe Lys Arg Gly Val Gin Gly Lys Gin Phe Arg Ser Thr He Leu Leu 

20 25 30 

5 Leu Met Ala Thr Ala Leu Asn Val Arg Val Pro Glu Ala Leu He Gly 

35 40 45 

Glu Ser Thr Asp He Val Thr Ser Glu Leu Arg Val Arg Gin Arg Gly 

50 55 60 

He Ala Glu He Thr Glu Met He His Val Ala Ser Leu Leu His Asp 
10 65 70 75 80 

Asp Val Leu Asp Asp Ala Asp Thr Arg Arg Gly Val Gly Ser Leu Asn 

85 90 95 

Val Val Met Gly Asn Lys Val Val Ala Leu Leu Ala Thr Ala Val Glu 

100 105 110 

;J5 His Leu Val Thr Gly Glu Thr Met Glu He Thr Ser Ser Thr Glu Gin 
B 115 120 125 

^ Arg Tyr Ser Met Asp Tyr Tyr Met Gin Lys Thr Tyr Tyr Lys Thr Ala 
;Q 130 135 140 

□ Ser Leu lie Ser Asn Ser Cys Lys Ala Val Ala Val Leu Thr Gly Gin 

l0 145 150 155 160 

^ Thr Ala Glu Val Ala Val Leu Ala Phe Glu Tyr Gly Arg Asn Leu Gly 

165 170 175 

;3 Leu Ala Phe Gin Leu He Asp Asp He Leu Asp Phe Thr Gly Thr Ser 

Z 180 185 190 

;g5 Ala Ser Leu Gly Lys Gly Ser Leu Ser Asp He Arg His Gly Val He 
; 5 195 200 205 

Thr Ala Pro He Leu Phe Ala Met Glu Glu Phe Pro Gin Leu Arg Glu 

210 215 220 

Val Val Asp Gin Val Glu Lys Asp Pro Arg Asn Val Asp He Ala Leu 
30 225 230 235 240 

Glu Tyr Leu Gly Lys Ser Lys Gly He Gin Arg Ala Arg Glu Leu Ala 

245 250 255 

Met Glu His Ala Asn Leu Ala Ala Ala Ala He Gly Ser Leu Pro Glu 

260 265 270 

35 Thr Asp Asn Glu Asp Val Lys Arg Ser Arg Arg Ala Leu He Asp Leu 

275 280 285 

Thr His Arg Val He Thr Arg Asn Lys 
290 295 

40 <210> 10 
<211> 561 
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<212> DNA 

<213> Arabidopsis sp 
<400> 10 

aagcgcatcc gtcctcttct acgattgccg ccagccgcat gtatggctgc ataaccgacc 60 

gcccctatcc gctcgcggcc gcggtcgaat tcattcacac cgcgacgctg ctgcatgacg 120 

acgtcgtcga tgaaagcgat ttgcgccgcg gccgcgaaag cgcgcataag gttttcggca 180 

atcaggcgag cgtgctcgtc ggcgatttcc ttttctcccg cgccttccag ctgatggtgg 240 

aagacggctc gctcgacgcg ctgcgcattc tctcggatgc ctccgccgtg atcgcgcagg 300 

gcgaagtgat gcagctcggc accgcgcgca atcttgaaac caatatgagc cagtatctcg 360 

atgtgatcag cgcgaagacc gccgcgctct ttgccgccgc ctgcgaaatc ggcccggtga 420 

tggcgaacgc gaaggcggaa gatgctgccg cgatgtgcga atacggcatg aatctcggta 480 

tcgccttcca gatcatcgac gaccttctcg attacggcac cggcggccac gccgagcttg 540 

gcaagaacac gggcgacgat t 5^ 

<210> 11 
<211> 966 
<212> DNA 

<213> Arabidopsis sp 
<400> 11 

atggtacttg ccgaggttcc aaagcttgcc tctgctgctg agtacttctt caaaaggggt 60 
gtgcaaggaa aacagtttcg ttcaactatt ttgctgctga tggcgacagc tctgaatgta 120 
cgcgttccag aagcattgat tggggaatca acagatatag tcacatcaga attacgcgta 180 
aggcaacggg gtattgctga aatcactgaa atgatacacg tcgcaagtct actgcacgat 240 
gatgtcttgg atgatgccga tacaaggcgt ggtgttggtt ccttaaatgt tgtaatgggt 300 
aacaagatgt cggtattagc aggagacttc ttgctctccc gggcttgtgg ggctctcgct 360 
gctttaaaga acacagaggt tgtagcatta cttgcaactg ctgtagaaca tcttgttacc 420 
ggtgaaacca tggaaataac tagttcaacc gagcagcgtt atagtatgga ctactacatg 480 
cagaagacat attataagac agcatcgcta atctctaaca gctgcaaagc tgttgccgtt 540 
ctcactggac aaacagcaga agttgccgtg ttagcttttg agtatgggag gaatctgggt 600 
ttagcattcc aattaataga cgacattctt gatttcacgg gcacatctgc ctctctcgga 660 
aagggatcgt tgtcagatat tcgccatgga gtcataacag ccccaatcct ctttgccatg 720 
gaagagtttc ctcaactacg cgaagttgtt gatcaagttg aaaaagatcc taggaatgtt 
gacattgctt tagagtatct tgggaagagc aagggaatac agagggcaag agaattagcc 
atggaacatg cgaatctagc agcagctgca atcgggtctc tacctgaaac agacaatgaa 
gatgtcaaaa gatcgaggcg ggcacttatt gacttgaccc atagagtcat caccagaaac 
aagtga 



780 
840 
900 
960 
966 



<210> 12 
<211> 321 
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<212> PRT 

<213> Arabidopsis sp 



<400> 12 

Met Val Leu Ala Glu Val Pro Lys Leu Ala Ser Ala Ala Glu Tyr Phe 

15 10 15 

Phe Lys Arg Gly Val Gin Gly Lys Gin Phe Arg Ser Thr lie Leu Leu 

20 25 30 

Leu Met Ala Thr Ala Leu Asn Val Arg Val Pro Glu Ala Leu lie Gly 

35 40 45 

Glu Ser Thr Asp He Val Thr Ser Glu Leu Arg Val Arg Gin Arg Gly 

50 55 60 

He Ala Glu He Thr Glu Met He His Val Ala Ser Leu Leu His Asp 
65 70 75 80 

Asp Val Leu Asp Asp Ala Asp Thr Arg Arg Gly Val Gly Ser Leu Asn 

85 90 95 

Val Val Met Gly Asn Lys Met Ser Val Leu Ala Gly Asp Phe Leu Leu 

100 105 110 

Ser Arg Ala Cys Gly Ala Leu Ala Ala Leu Lys Asn Thr Glu Val Val 

115 120 125 

Ala Leu Leu Ala Thr Ala Val Glu His Leu Val Thr Gly Glu Thr Met 

130 135 140 

Glu He Thr Ser Ser Thr Glu Gin Arg Tyr Ser Met Asp Tyr Tyr Met 
145 150 155 160 

Gin Lys Thr Tyr Tyr Lys Thr Ala Ser Leu He Ser Asn Ser Cys Lys 

165 170 175 

Ala Val Ala Val Leu Thr Gly Gin Thr Ala Glu Val Ala Val Leu Ala 

180 185 190 

Phe Glu Tyr Gly Arg Asn Leu Gly Leu Ala Phe Gin Leu He Asp Asp 

195 200 205 

He Leu Asp Phe Thr Gly Thr Ser Ala Ser Leu Gly Lys Gly Ser Leu 

210 215 220 

Ser Asp He Arg His Gly Val He Thr Ala Pro He Leu Phe Ala Met 
225 230 235 240 

Glu Glu Phe Pro Gin Leu Arg Glu Val Val Asp Gin Val Glu Lys Asp 

245 250 255 

Pro Arg Asn Val Asp He Ala Leu Glu Tyr Leu Gly Lys Ser Lys Gly 

260 265 270 

He Gin Arg Ala Arg Glu Leu Ala Met Glu His Ala Asn Leu Ala Ala 

275 280 285 

Ala Ala He Gly Ser Leu Pro Glu Thr Asp Asn Glu Asp Val Lys Arg 
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290 295 300 

Ser Arg Arg Ala Leu lie Asp Leu Thr His Arg Val lie Thr Arg Asn 
305 310 315 320 

Lys 

<210> 13 
<211> 621 
<212> DNA 

<213> Arabidopsis sp 



<400> 13 

gctttctcct ttgctaattc ttgagctttc ttgatcccac cgcgatttct aactatttca 60 

atcgcttctt caagcgatcc aggctcacaa aactcagact caatgatctc tcttagcctt 120 

ggctcattct ctagcgcgaa gatcactggc gccgttatgt tacctttggc taagtcatta 180 

gctgcaggct tacctaactg ctctgtggac tgagtgaagt ccagaatgtc atcaactact 240 

tgaaaagata aaccgagatt cttcccgaac tgatacattt gctctgcgac cttgctttcg 300 

actttactga aaattgctgc tcctttggtg cttgcagcta ctaatgaagc tgtcttgtag 360 

taactcttta gcatgtagtc atcaagcttg acatcacaat cgaataaact cgatgcttgc 420 

tttatctcac cgcttgcaaa atctttgatc acctgcaaaa agataaatca agattcagac 480 

caaatgttct ttgtattgag tagcttcatc taatctcaga aaggaatatt acctgactta 540 

tgagcttaat gacttcaagg ttttcgagat ttgtaagtac catgatgctt gagcaacatg 600 

aaatccccag ctaatacagc t 521 



<210> 14 
<211> 741 
<212> DNA 

<213> Arabidopsis sp 



<400> 14 

ggtgagtttt gttaatagtt atgagattca tctatttttg tcataaaatt gtttggtttg 60 

gtttaaactc tgtgtataat tgcaggaaag gaaacagttc atgagctttt cggcacaaga 120 

gtagcggtgc tagctggaga tttcatgttt gctcaagcgt catggtactt agcaaatctc 180 

gagaatcttg aagttattaa gctcatcagt caggtactta gttactctta cattgttttt 240 

ctatgaggtt gagctatgaa tctcatttcg ttgaataatg ctgtgcctca aacttttttt 300 

catgttttca ggtgatcaaa gactttgcaa gcggagagat aaagcaggcg tccagcttat 360 

ttgactgcga caccaagctc gacgagtact tactcaaaag tttctacaag acagcctctt 420 

tagtggctgc gagcaccaaa ggagctgcca ttttcagcag agttgagcct gatgtgacag 480 

aacaaatgta cgagtttggg aagaatctcg gtctctcttt ccagatagtt gatgatattt 540 

tggatttcac tcagtcgaca gagcagctcg ggaagccagc agggagtgat ttggctaaag 600 

gtaacttaac agcacctgtg attttcgctc tggagaggga gccaaggcta agagagatca 660 
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ttgagtcaaa gttctgtgag gcgggttctc tggaagaagc gattgaagcg gtgacaaaag 720 
gtggggggat taagagagca c 741 



<210> 15 
<211> 1087 
<212> DNA 

<213> Arabidopsis sp 



<400> 15 

cctcttcagc caatccagag gaagaagaga caacttttta tctttcgtca agagtctccg 60 

aaaacgcacg gttttatgct ctctcttctg ccctcacctc acaagacgca gggcacatga 120 

ttcaaccaga gggaaaaagc aacgataaca actctgcttt tgatttcaag ctgtatatga 180 

tccgcaaagc cgagtctgta aatgcggctc tcgacgtttc cgtaccgctt ctgaaacccc 240 

ttacgatcca agaagcggtc aggtactctt tgctagccgg cggaaaacgt gtgaggcctc 300 

tgctctgcat tgccgcttgt gagcttgtgg ggggcgacga ggctactgcc atgtcagccg 360 

cttgcgcggt cgagatgatc cacacaagct ctctcattca tgacgatctt ccgtgcatgg 420 

acaatgccga cctccgtaga ggcaagccca ccaatcacaa ggtatgttgt ttaattatat 480 

gaaggctcag agataatgct gaactagtgt tgaaccaatt tttgctcaaa caaggtatat 540 

ggagaagaca tggcggtttt ggcaggtgat gcactccttg cattggcgtt tgagcacatg 600 

acggttgtgt cgagtgggtt ggtcgctccc gagaagatga ttcgcgccgt ggttgagctg 660 

gccagggcca tagggactac agggctagtt gctggacaaa tgatagacct agccagcgaa 720 

agactgaatc cagacaaggt tggattggag catctagagt tcatccatct ccacaaaacg 780 

gcggcattgt tggaggcagc ggcagtttta ggggttataa tgggaggtgg aacagaggaa 840 

gaaatcgaaa agcttagaaa gtatgctagg tgtattggac tactgtttca ggttgttgat 900 

gacattctcg acgtaacaaa atctactgag gaattgggta agacagccgg aaaagacgta 960 

atggccggaa agctgacgta tccaaggctg ataggtttgg agggatccag ggaagttgca 1020 

gagcacctga ggagagaagc agaggaaaag cttaaagggt ttgatccaag tcaggcggcg 1080 

cctctgg 1087 



<210> 16 
<211> 1164 
<212> DNA 

<213> Arabidopsis sp 



<400> 16 

atgacttcga ttctcaacac tgtctccacc atccactctt ccagagttac ctccgtcgat 60 

cgagtcggag tcctctctct tcggaattcg gattccgttg agttcactcg ccggcgttct 120 

ggtttctcga cgttgatcta cgaatcaccc gggcggagat ttgttgtgcg tgcggcggag 180 

actgatactg ataaagttaa atctcagaca cctgacaagg caccagccgg tggttcaagc 240 

attaaccagc ttctcggtat caaaggagca tctcaagaaa ctaataaatg gaagattcgt 300 

cttcagctta caaaaccagt cacttggcct ccactggttt ggggagtcgt ctgtggtgct 360 
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10 



15 



20 



15 



30 



35 



40 



J _ J 1 

gctgcttcag 


ggaactttca 


ttggacccca 


gaggatgttg 


ctaagtcgat 


tctttgcatg 


420 


atgatgtctg 


i i r i i 

gtccttgtct 


j r ii 

tactggctat 


acacagacaa 


tcaacgactg 


gtatgataga 


480 


gatatcgacg 


caattaatga 


gccatatcgt 


ccaattccat 


ctggagcaat 


atcagagcca 


540 


1 ! 11 

gaggttatta 


cacaagtctg 


ggtgctatta 


ttgggaggtc 


ttggtattgc 


tggaatatta 


f\ S~\ 

600 


i J 1 

gatgtgtggg 


cagggcatac 


cactcccact 


gtcttctatc 


ttgctttggg 


aggatcattg 


660 


j j j i . j 

c-tatcttata 


tatactctgc 


t ccacctctt 


aagctaaaac 


aaaatggatg 


ggttggaaat 


n a ry 

720 


tttgcacttg 


gagcaagcta 


tattagtttg 


ccat ggtggg 


ctggccaagc 


attgtttggc 


*~i o r\ 

7 80 


acrcttacgc 


cagatgttgt 


tgttctaaca 


ctcttgtaca 


gcatagctgg 


gttaggaata 


84 0 


gccattgtta 


acgacttcaa 


aagtgttgaa 


ggagatagag 


cattaggact 


tcagtctctc 


AAA 

900 


ccagtagctt 


ttggcaccga 


aactgcaaaa 


tggatatgcg 


ttggtgctat 


agacattact 


960 


cagctttctg 


ttgccggata 


tctattagca 


tctgggaaac 


cttattatgc 


gttggcgttg 


1020 


gttgctttga 


tcattcctca 


gattgtgttc 


cagtttaaat 


actttctcaa 


ggaccctgtc 


1080 


aaatacgacg 


t caagtacca 


ggcaagcgcg 


cagccattct 


tggtgctcgg 


aatatttgta 


1140 


acggcattag 


catcgcaaca 


ctga 
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<210> 17 
<211> 387 
<212> PRT 

<213> Arabidopsis sp 



<400> 17 

Met Thr Ser lie Leu Asn Thr Val Ser Thr lie His Ser Ser Arg Val 

15 10 15 

Thr Ser Val Asp Arg Val Gly Val Leu Ser Leu Arg Asn Ser Asp Ser 

20 25 30 

Val Glu Phe Thr Arg Arg Arg Ser Gly Phe Ser Thr Leu lie Tyr Glu 

35 40 45 

Ser Pro Gly Arg Arg Phe Val Val Arg Ala Ala Glu Thr Asp Thr Asp 

50 55 60 

Lys Val Lys Ser Gin Thr Pro Asp Lys Ala Pro Ala Gly Gly Ser Ser 
65 70 75 80 

lie Asn Gin Leu Leu Gly lie Lys Gly Ala Ser Gin Glu Thr Asn Lys 

85 90 95 

Trp Lys lie Arg Leu Gin Leu Thr Lys Pro Val Thr Trp Pro Pro Leu 

100 105 110 

Val Trp Gly Val Val Cys Gly Ala Ala Ala Ser Gly Asn Phe His Trp 

115 120 125 

Thr Pro Glu Asp Val Ala Lys Ser lie Leu Cys Met Met Met Ser Gly 

130 135 140 

Pro Cys Leu Thr Gly Tyr Thr Gin Thr lie Asn Asp Trp Tyr Asp Arg 
145 150 155 160 



14 
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Asp lie Asp 

lie Ser Glu 

5 Gly Leu Gly 

195 

Pro Thr Val 

210 
Tyr Ser Ala 
10 225 

Phe Ala Leu 

Ala Leu Phe 
K35 Tyr Ser lie 

■ b *i 

'v? 27 5 

[fj Val Glu Gly 
ill 290 

□ Gly Thr Glu 
;20 305 

1 

Gin Leu Ser 

Ala Leu Ala 

:fi : 5 Lys Tyr Phe 

, MIA 

□ 355 

Ser Ala Gin 

370 
Ser Gin His 
30 385 



Ala He Asn Glu Pro 
165 

Pro Glu Val He Thr 
180 

He Ala Gly He Leu 

200 

Phe Tyr Leu Ala Leu 

215 

Pro Pro Leu Lys Leu 
230 

Gly Ala Ser Tyr He 
245 

Gly Thr Leu Thr Pro 
260 

Ala Gly Leu Gly He 

280 

Asp Arg Ala Leu Gly 

295 

Thr Ala Lys Trp He 
310 

Val Ala Gly Tyr Leu 
325 

Leu Val Ala Leu He 
340 

Leu Lys Asp Pro Val 

360 

Pro Phe Leu Val Leu 

375 



Tyr Arg Pro He 
170 

Gin Val Trp Val 
185 

Asp Val Trp Ala 

Gly Gly Ser Leu 

220 

Lys Gin Asn Gly 
235 

Ser Leu Pro Trp 
250 

Asp Val Val Val 
265 

Ala He Val Asn 

Leu Gin Ser Leu 

300 

Cys Val Gly Ala 
315 

Leu Ala Ser Gly 
330 

He Pro Gin He 
345 

Lys Tyr Asp Val 

Gly He Phe Val 

380 



Pro Ser Gly Ala 
175 

Leu Leu Leu Gly 
190 

Gly His Thr Thr 
205 

Leu Ser Tyr He 

Trp Val Gly Asn 

240 

Trp Ala Gly Gin 
255 

Leu Thr Leu Leu 
270 

Asp Phe Lys Ser 
285 

Pro Val Ala Phe 

He Asp He Thr 

320 

Lys Pro Tyr Tyr 
335 

Val Phe Gin Phe 
350 

Lys Tyr Gin Ala 
365 

Thr Ala Leu Ala 



<210> 18 
<211> 981 
<212> DNA 
35 <213> Arabidopsis sp 



<400> 18 

atgttgttta gtggttcagc gatcccatta agcagcttct gctctcttcc ggagaaaccc 60 

cacactcttc ctatgaaact ctctcccgct gcaatccgat cttcatcctc atctgccccg 120 

4 0 gggtcgttga acttcgatct gaggacgtat tggacgactc tgatcaccga gatcaaccag 180 

aagctggatg aggccatacc ggtcaagcac cctgcgggga tctacgaggc tatgagatac 240 
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tctgtactcg cacaaggcgc caagcgtgcc cctcctgtga tgtgtgtggc ggcctgcgag 300 

ctcttcggtg gcgatcgcct cgccgctttc cccaccgcct gtgccctaga aatggtgcac 360 

gcggcttcgt tgatacacga cgacctcccc tgtatggacg acgatcctgt gcgcagagga 420 

aagccatcta accacactgt ctacggctct ggcatggcca ttctcgccgg tgacgccctc 480 

ttcccactcg ccttccagca cattgtctcc cacacgcctc ctgaccttgt tccccgagcc 540 

accatcctca gactcatcac tgagattgcc cgcactgtcg gctccactgg tatggctgca 600 

ggccagtacg tcgaccttga aggaggtccc tttcctcttt cctttgttca ggagaagaaa 660 

ttcggagcca tgggtgaatg ctctgccgtg tgcggtggcc tattgggcgg tgccactgag 720 

gatgagctcc agagtctccg aaggtacggg agagccgtcg ggatgctgta tcaggtggtc 780 

gatgacatca ccgaggacaa gaagaagagc tatgatggtg gagcagagaa gggaatgatg 840 

gaaatggcgg aagagctcaa ggagaaggcg aagaaggagc ttcaagtgtt tgacaacaag 900 

tatggaggag gagacacact tgttcctctc tacaccttcg ttgactacgc tgctcatcga 960 

cattttcttc ttcccctctg a 9Q1 



<210> 19 
<211> 245 
<212> DNA 
<213> GLycine sp 



<400> 19 

gcaacatctg ggactgggtt tgtcttgggg agtggtagtg ctgttgatct ttcggcactt 60 

tcttgcactt gcttgggtac catgatggtt gctgcatctg ctaactcttt gaatcaggtg 120 

tttgagatca ataatgatgc taaaatgaag agaacaagtc gcaggccact accctcagga 180 

cgcatcacaa tacctcatgc agttggctgg gcatcctctg ttggattagc tggtacggct 240 

ctact 245 



<210> 20 
<211> 253 
<212> DNA 
<213> Glycine sp 



<400> 20 

attggctttc caagatcatt gggttttctt gttgcattca tgaccttcta ctccttgggt 60 

ttggcattgt ccaaggatat acctgacgtt gaaggagata aagagcacgg cattgattct 120 

tttgcagtac gtctaggtca gaaacgggca ttttggattt gcgtttcctt ttttgaaatg 180 

gctttcggag ttggtatcct ggccggagca tcatgctcac acttttggac taaaattttc 240 

acgggtatgg gaa 253 



<210> 21 
<211> 275 
<212> DNA 
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<213> Glycine sp 



<400> 21 

tgatcttcta ctctctgggt atggcattgt ccaaggatat atctgacgtt aaaggagata 60 

aagcatacgg catcgatact ttagcgatac gtttgggtca aaaatgggta ttttggattt 120 

gcattatcct ttttgaaatg gcttttggag ttgccctctt ggcaggagca acatcttctt 180 

acctttggat taaaattgtc acgggtctgg gacatgctat tcttgcttca attctcttgt 240 

accaagccaa atctatatac ttgagcaaca aagtt 275 



<210> 22 
<211> 299 
<212> DNA 
<213> Glycine sp 

<220> 

<221> misc_feature 
<222> (1) . . . (299) 
<223> n — A, T , C or G 



<400> 22 

ccanaatang tncatcttng aaagacaatt ggcctcttca acacacaagt ctgcatgtga 60 

agaagaggcc aattgtcttt ccaagatcac ttatngtggc tattgtaatc atgaacttct 120 

tctttgtggg tatggcattg gcaaaggata tacctanctg ttgaaggaga taaaatatat 180 

ggcattgata cttttgcaat acgtataggt caaaaacaag tattttggat ttgtattttc 240 

ctttttgaaa ggctttcgga gtttccctag tggcaggagc aacatcttct agccttggt 299 



<210> 23 
<211> 767 
<212> DNA 
<213> Glycine sp 



<400> 23 

gtggaggctg tggttgctgc cctgtttatg aatatttata ttgttggttt gaatcaattg 60 

tctgatgttg aaatagacaa gataaacaag ccgtatcttc cattagcatc tggggaatat 120 

tcctttgaaa ctggtgtcac tattgttgca tctttttcaa ttctgagttt ttggcttggc 180 

tgggttgtag gttcatggcc attattttgg gccctttttg taagctttgt gctaggaact 240 

gcttattcaa tcaatgtgcc tctgttgaga tggaagaggt ttgcagtgct tgcagcgatg 300 

tgcattctag ctgttcgggc agtaatagtt caacttgcat ttttccttca catgcagact 360 

catgtgtaca agaggccacc tgtcttttca agaccattga tttttgctac tgcattcatg 420 

agcttcttct ctgtagttat agcactgttt aaggatatac ctgacattga aggagataaa 480 

gtatttggca tccaatcttt ttcagtgtgt ttaggtcaga agccggtgtt ctggacttgt 540 
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gttacccttc ttgaaatagc ttatggagtc gccctcctgg tgggagctgc atctccttgt 600 
ctttggagca aaattttcac gggtctggga cacgctgtgc tggcttcaat tctctggttt 660 
catgccaaat ctgtagattt gaaaagcaaa gcttcgataa catccttcta tatgtttatt 720 
tggaagctat tttatgcaga atacttactc attccttttg ttagatg 767 

<210> 24 
<211> 255 
<212> PRT 
<213> Glycine sp 



<400> 24 

Val Glu Ala Val Val Ala Ala Leu Phe Met Asn lie Tyr lie Val Gly 

15 10 15 

Leu Asn Gin Leu Ser Asp Val Glu lie Asp Lys lie Asn Lys Pro Tyr 
:|5 20 25 30 

if--" 

;j? Leu Pro Leu Ala Ser Gly Glu Tyr Ser Phe Glu Thr Gly Val Thr He 

■ > -4 

h 35 40 45 

\U Val Ala Ser Phe Ser He Leu Ser Phe Trp Leu Gly Trp Val Val Gly 

;f 50 55 60 

; g0 Ser Trp Pro Leu Phe Trp Ala Leu Phe Val Ser Phe Val Leu Gly Thr 
7 65 70 75 80 

^ Ala Tyr Ser He Asn Val Pro Leu Leu Arg Trp Lys Arg Phe Ala Val 
^ 85 90 95 

P 

Leu Ala Ala Met Cys He Leu Ala Val Arg Ala Val He Val Gin Leu 
§5 100 105 110 

;3 Ala Phe Phe Leu His Met Gin Thr His Val Tyr Lys Arg Pro Pro Val 

115 120 125 

Phe Ser Arg Pro Leu He Phe Ala Thr Ala Phe Met Ser Phe Phe Ser 
130 135 140 

30 Val Val He Ala Leu Phe Lys Asp He Pro Asp He Glu Gly Asp Lys 
145 150 155 160 

Val Phe Gly He Gin Ser Phe Ser Val Cys Leu Gly Gin Lys Pro Val 

165 170 175 

Phe Trp Thr Cys Val Thr Leu Leu Glu He Ala Tyr Gly Val Ala Leu 
35 180 185 190 

Leu Val Gly Ala Ala Ser Pro Cys Leu Trp Ser Lys He Phe Thr Gly 

195 200 205 

Leu Gly His Ala Val Leu Ala Ser He Leu Trp Phe His Ala Lys Ser 
210 215 220 

4 0 Val Asp Leu Lys Ser Lys Ala Ser He Thr Ser Phe Tyr Met Phe He 
225 230 235 240 
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Trp Lys Leu Phe Tyr Ala Glu Tyr Leu Leu lie Pro Phe Val Arg 

245 250 255 

<210> 25 
5 <211> 360 
<212> DNA 
<213> Zea sp 

<220> 

10 <221> misc_f eature 

<222> (1) . . . (360) 

<223> n - A,T,C or G 



<400> 25 














-: ; ,J5 ggcgtcttca 


cttgttctgg 


tcttctcgta 


tcccctgatg 


aagaggttca 


cattttggcc 


60 


' : !z tcaggcttat 


cttggcctga 


cattcaactg 


gggagcttta 


ctagggtggg 


ctgctattaa 


120 


Jfi ggaaagcata 


gaccctgcaa 


atcat ccttc 


cattgtatac 


agctggtatt 


tgttggacgc 


180 


I'M tggtgtatga 


tactatatat 


gcgcatcagg 


tgtttcgcta 


tccctacttt 


catattaatc 


240 


cttgatgaag 

T ^ T* 


tggccatttc 


atgttgtcgc 


ggtggtctta 


tacttgcata 


tctccatgca 


300 


; 20 tctcaggaca 


aagangatga 


cctgaaagta 


ggagtccaag 


tccacagctt 


aagatttggg 


360 



: £ <210> 2 6 

3 <211> 299 

''2 <212> DNA 

S5 <213> Zea sp 

h it-" 1 ' 

<220> 

<221> misc_f eature 
<222> (1) . . . (299) 
30 <223> n - A,T,C or G 

<400> 26 

gatggttgca gcatctgcaa ataccctcaa ccaggtgttt gngataaaaa atgatgctaa 60 

aatgaaaagg acaatgcgtg ccccctgcca tctggtcgca ttagtcctgc acatgctgcg 120 

35 atgtgggcta caagtgttgg agttgcagga acagctttgt tggcctggaa ggctaatggc 180 

ttggcagctg ggcttgcagc ttctaatctt gttctgtatg catttgtgta tacgccgttg 240 

aagcaaatac accctgttaa tacatgggtt ggggcagtcg ttggtgccat cccaccact 299 

<210> 27 
40 <211> 255 
<212> DNA 
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<213> Zea sp 
<220> 

<221> misc feature 
5 <222> (1) . . . (255) 

<223> n - A,T,C or G 



<400> 27 

anacttgcat atctccatgc ntctcaggac aaagangatg acctgaaagt aggtgtcaag 60 

10 tccacagcat taagatttgg agatttgacc nnatactgna tcagtggctt tggcgcggca 120 

tgcttcggca gcttagcact cagtggttac aatgctgacc ttggttggtg tttagtgtga 180 

tgcttgagcg aagaatggta tngtttttac ttgatattga ctccagacct gaaatcatgt 240 

tggacagggt ggccc 255 

15 <210> 28 
y <211> 257 
<212> DNA 

mr »- 

y <213> Zea sp 

20 <400> 28 

attgaagggg ataggactct ggggcttcag tcacttcctg ttgcttttgg gatggaaact 60 
^ gcaaaatgga tttgtgttgg agcaattgat atcactcaat tatctgttgc aggttaccta 120 
•J ttgagcaccg gtaagctgta ttatgccctg gtgttgcttg ggctaacaat tcctcaggtg 180 
Z ttctttcagt tccagtactt cctgaaggac cctgtgaagt atgatgtcaa atatcaggca 240 
25 agcgcacaac cattctt 257 



<210> 29 

<211> 368 

<212> DNA 

30 <213> Zea sp 



35 



40 



<400> 29 



at ccagttgc 


aaataataat 


ggcgttcttc 


tctgttgtaa 


tagcactatt 


caaggatata 


60 


cctgacatcg 


aaggggaccg 


catattcggg 


atccgatcct 


tcagcgtccg 


gttagggcaa 


120 


aagaaggtct 


tttggatctg 


cgttggcttg 


cttgagatgg 


cctacagcgt 


tgcgatactg 


180 


atgggagcta 


cctctt cctg 


tttgtggagc 


aaaacagcaa 


ccatcgctgg 


ccattccata 


240 


cttgccgcga 


tcctatggag 


ctgcgcgcga 


tcggtggact 


tgacgagcaa 


agccgcaata 


300 


acgtccttct 


acatgttcat 


ctggaagctg 


ttctacgcgg 


agtacctgct 


catccctctg 


360 


gtgcggtg 












368 



<210> 30 
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<211> 122 
<212> PRT 
<213> Zea sp 



<400> 30 

■He Gin Leu Gin He He Met Ala Phe Phe Ser Val Val He Ala Leu 

15 10 15 

Phe Lys Asp He Pro Asp He Glu Gly Asp Arg He Phe Gly He Arg 

20 25 30 

Ser Phe Ser Val Arg Leu Gly Gin Lys Lys Val Phe Trp He Cys Val 

35 40 45 

Gly Leu Leu Glu Met Ala Tyr Ser Val Ala He Leu Met Gly Ala Thr 

50 55 60 

Ser Ser Cys Leu Trp Ser Lys Thr Ala Thr He Ala Gly His Ser He 
65 70 75 80 

Leu Ala Ala He Leu Trp Ser Cys Ala Arg Ser Val Asp Leu Thr Ser 

85 90 95 

Lys Ala Ala He Thr Ser Phe Tyr Met Phe He Trp Lys Leu Phe Tyr 

100 105 110 

Ala Glu Tyr Leu Leu He Pro Leu Val Arg 

115 120 



<210> 31 
<211> 278 
<212> DNA 
<213> Zea sp 



<400> 31 














tattcagcac 


cacctctcaa 


gctcaagcag 


aatggatgga 


ttgggaactt 


cgctctgggt 


60 


gcgagttaca 


tcagcttgcc 


ctggtgggct 


ggccaggcgt 


tatttggaac 


tcttacacca 


120 


gatatcattg 


tcttgactac 


tttgtacagc 


atagctgggc 


tagggattgc 


tattgtaaat 


180 


gatttcaaga 


gtattgaagg 


ggataggact 


ctggggcttc 


agtcacttcc 


tgttgctttt 


240 


gggatggaaa 


ctgcaaaatg 


gatttgtgtt 


ggagcaat 






278 



<210> 32 
<211> 292 
<212> PRT 

<213> Synechocystis sp 



<400> 32 

Met Val Ala Gin Thr Pro Ser Ser Pro Pro Leu Trp Leu Thr He He 
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1 

Tyr Leu Leu Arg 

20 

Pro Ala Leu Trp 
35 

Pro Leu Leu Gly 
50 

Gly Cys Val Val 
65 

Glu Arg Thr Lys 

Val Gly He Gly 

100 

Ala Phe Tyr Leu 

115 

Pro Val He Val 
130 

Gin Leu Val Leu 
145 

Ser Ala Val Thr 

Ala Thr Val Phe 

180 

Asp Arg Glu Asp 

195 

Phe Gly Gin Tyr 
210 

He Gly Cys Leu 
225 

Tyr Trp Leu Ser 

He Gin Leu Ser 

260 

Phe Gly Gin Asn 
275 

Leu Gly Trp Leu 
290 



5 

Trp His Lys Pro 

Ala Val Cys Leu 

40 

Thr He Ala Leu 
55 

Asn Asp Leu Trp 
70 

Gin Arg Pro Leu 
85 

Val Ala Leu Val 

Thr Pro Leu Ser 

120 

Ala Tyr Pro Gly 

135 

Ser He Ala Trp 
150 

Gly Asp Leu Thr 
165 

Trp Thr Leu Gly 

Asp Arg Arg He 

200 

Val Gly Glu Ala 
215 

Phe Tyr Leu Gly 
230 

Leu Ala He Ala 
245 

Ala Pro Thr Pro 

Val He He Gly 

280 



10 

Ala Gly Arg Leu 
25 

Ala Ala Gin Gly 

Gly Thr Leu Ala 

60 

Asp Arg Asp He 

75 

Ala Ala Arg Ala 
90 

Ala Leu Leu Cys 
105 

Phe Trp Leu Cys 

Ala Lys Arg Val 

140 

Gly Phe Ala Val 

155 

Asp Ala Thr Trp 
170 

Phe Asp Thr Val 
185 

Gly Val Asn Ser 

Val Gly He Phe 

220 

Met He Leu Met 

235 

He Val Gly Trp 
250 

Glu Pro Lys Leu 
265 

Phe Val Leu Leu 



15 

He Leu Met He 
30 

Leu Pro Pro Leu 
45 

Thr Ser Gly Leu 

Asp Pro Gin Val 

80 

Leu Ser Val Gin 

95 

Ala Ala Gly Leu 
110 

Val Ala Ala Val 
125 

Phe Pro Val Pro 

Leu He Ser Trp 

160 

Val Leu Trp Gly 

175 

Tyr Ala Met Ala 
190 

Ser Ala Leu Phe 
205 

Phe Ala Leu Thr 

Leu Asn Pro Leu 

240 

Val He Gin Tyr 
255 

Tyr Gly Gin He 
270 

Ala Gly Met Leu 
285 



<210> 33 
<211> 316 
<212> PRT 
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<400> 33 

Met Val Thr Ser Thr Lys lie His Arg Gin His Asp Ser Met Gly Ala 
5 1 5 10 15 

Val Cys Lys Ser Tyr Tyr Gin Leu Thr Lys Pro Arg lie lie Pro Leu 

20 25 30 

Leu Leu lie Thr Thr Ala Ala Ser Met Trp lie Ala Ser Glu Gly Arg 
35 40 45 

10 Val Asp Leu Pro Lys Leu Leu lie Thr Leu Leu Gly Gly Thr Leu Ala 

50 55 60 

Ala Ala Ser Ala Gin Thr Leu Asn Cys lie Tyr Asp Gin Asp lie Asp 
65 70 75 80 

Tyr Glu Met Leu Arg Thr Arg Ala Arg Pro lie Pro Ala Gly Lys Val 
f.5 85 90 95 

% Gin Pro Arg His Ala Leu lie Phe Ala Leu Ala Leu Gly Val Leu Ser 
n 100 105 110 

0 Phe Ala Leu Leu Ala Thr Phe Val Asn Val Leu Ser Gly Cys Leu Ala 

1 115 120 125 

IF *> 
++ *T 

20 Leu Ser Gly lie Val Phe Tyr Met Leu Val Tyr Thr His Trp Leu Lys 

130 135 140 

* Arg His Thr Ala Gin Asn He Val He Gly Gly Ala Ala Gly Ser He 
^ 145 150 155 160 

3s Pro Pro Leu Val Gly Trp Ala Ala Val Thr Gly Asp Leu Ser Trp Thr 

|5 165 170 175 

3 Pro Trp Val Leu Phe Ala Leu He Phe Leu Trp Thr Pro Pro His Phe 

180 185 190 

Trp Ala Leu Ala Leu Met He Lys Asp Asp Tyr Ala Gin Val Asn Val 
195 200 205 

30 Pro Met Leu Pro Val He Ala Gly Glu Glu Lys Thr Val Ser Gin He 

210 215 220 

Trp Tyr Tyr Ser Leu Leu Val Val Pro Phe Ser Leu Leu Leu Val Tyr 
225 230 235 240 

Pro Leu His Gin Leu Gly He Leu Tyr Leu Ala He Ala He He Leu 
35 245 250 255 

Gly Gly Gin Phe Leu Val Lys Ala Trp Gin Leu Lys Gin Ala Pro Gly 

260 265 270 

Asp Arg Asp Leu Ala Arg Gly Leu Phe Lys Phe Ser He Phe Tyr Leu 
275 280 285 

4 0 Met Leu Leu Cys Leu Ala Met Val He Asp Ser Leu Pro Val Thr His 

290 295 300 
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Gin Leu Val Ala Gin Met Gly Thr Leu Leu Leu Gly 
305 310 315 



<210> 34 
5 <211> 324 
<212> PRT 

<213> Synechocystis sp 
<400> 34 

10 Met Ser Asp Thr Gin Asn Thr Gly Gin Asn Gin Ala Lys Ala Arg Gin 
15 10 15 

Leu Leu Gly Met Lys Gly Ala Ala Pro Gly Glu Ser Ser He Trp Lys 

20 25 30 

He Arg Leu Gin Leu Met Lys Pro He Thr Trp He Pro Leu He Trp 
3.5 35 4 0 4 5 

Z G1 Y Val v ^l Cys Gly Ala Ala Ser Ser Gly Gly Tyr He Trp Ser Val 

M Ft 

n 5 0 55 60 

Q Glu Asp Phe Leu Lys Ala Leu Thr Cys Met Leu Leu Ser Gly Pro Leu 

3 65 70 75 80 

20 Met Thr Gly Tyr Thr Gin Thr Leu Asn Asp Phe Tyr Asp Arg Asp He 

85 90 95 

£ Asp Ala He Asn Glu Pro Tyr Arg Pro He Pro Ser Gly Ala He Ser 
3 100 105 no 

~ Val Pro Gin Val Val Thr Gin He Leu He Leu Leu Val Ala Gly He 
25 115 120 125 

■■ ■. 
hi <i 

3 Gly Val Ala Tyr Gly Leu Asp Val Trp Ala Gin His Asp Phe Pro He 

130 135 140 

Met Met Val Leu Thr Leu Gly Gly Ala Phe Val Ala Tyr He Tyr Ser 
145 150 155 160 

3 0 Ala Pro Pro Leu Lys Leu Lys Gin Asn Gly Trp Leu Gly Asn Tyr Ala 

165 170 175 

Leu Gly Ala Ser Tyr He Ala Leu Pro Trp Trp Ala Gly His Ala Leu 

180 185 190 

Phe Gly Thr Leu Asn Pro Thr He Met Val Leu Thr Leu He Tyr Ser 
35 195 200 205 

Leu Ala Gly Leu Gly He Ala Val Val Asn Asp Phe Lys Ser Val Glu 

210 215 220 

Gly Asp Arg Gin Leu Gly Leu Lys Ser Leu Pro Val Met Phe Gly He 
225 230 235 240 

4 0 Gly Thr Ala Ala Trp He Cys Val He Met He Asp Val Phe Gin Ala 

245 250 255 
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Gly lie Ala Gly Tyr Leu lie Tyr Val His Gin Gin Leu Tyr Ala Thr 

260 265 270 

lie Val Leu Leu Leu Leu lie Pro Gin lie Thr Phe Gin Asp Met Tyr 

275 280 285 

Phe Leu Arg Asn Pro Leu Glu Asn Asp Val Lys Tyr Gin Ala Ser Ala 

290 295 300 

Gin Pro Phe Leu Val Phe Gly Met Leu Ala Thr Gly Leu Ala Leu Gly 
305 310 315 320 

His Ala Gly He 



<210> 35 
<211> 307 
<212> PRT 

<213> Synechocyst is sp 



<400> 35 
Met Thr Glu Ser 
1 

Lys Leu Trp Leu 

20 

Val Pro He Thr 

35 

Trp His Gly Asp 
50 

He Ala Trp He 
65 

He Asp Val Arg 

Asn Leu Val Phe 

100 

Gly Leu Met Ser 
115 

Leu He Gly Val 
130 

Phe Arg Leu Gly 
145 

Phe Gly Pro Leu 

Phe Ser Trp Asn 

180 



Ser Pro Leu Ala 
5 

Ala Ala He Lys 

Val Gly Ser Ala 

40 

Val Phe Thr He 
55 

Asn Leu Ser Asn 
70 

Lys Ala His Ser 
85 

Leu He Ser Asn 

Met Ser Trp Arg 

120 

Ala He Phe Leu 

135 

Tyr Leu Gly Leu 
150 

Ala He Ala Ala 
165 

Leu Leu Thr Pro 



Pro Ser Thr Ala 

10 

Pro Pro Met Tyr 
25 

Val Ala Tyr Gly 

Phe Leu Leu Ser 

60 

Asp Val Phe Asp 

75 

Val Val Asn Leu 
90 

Phe Phe Leu Leu 
105 

Ala Gin Asp Trp 

Gly Tyr Thr Tyr 

140 

Gly Glu Leu He 

155 

Ala Tyr Tyr Ser 
170 

Ser Val Phe Val 
185 



Pro Ala Thr Arg 
15 

Thr Val Ala Val 
30 

Leu Thr Gly Gin 
45 

Ala He Ala He 

Ser Asp Thr Gly 

80 

Thr Gly Asn Arg 
95 

Ala Gly Val Leu 
110 

Thr Val Leu Glu 

125 

Gin Gly Pro Pro 

Cys Leu He Thr 

160 

Gin Ser Gin Ser 

175 

Gly He Ser Thr 
190 



25 
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Ala lie lie Leu Phe Cys Ser His Phe His Gin Val Glu Asp Asp Leu 

195 200 1 205 

Ala Ala Gly Lys Lys Ser Pro lie Val Arg Leu Gly Thr Lys Leu Gly 

210 215 220 

Ser Gin Val Leu Thr Leu Ser Val Val Ser Leu Tyr Leu lie Thr Ala 
225 230 235 240 

lie Gly Val Leu Cys His Gin Ala Pro Trp Gin Thr Leu Leu lie lie 

245 250 255 

Ala Ser Leu Pro Trp Ala Val Gin Leu lie Arg His Val Gly Gin Tyr 

260 265 270 

His Asp Gin Pro Glu Gin Val Ser Asn Cys Lys Phe lie Ala Val Asn 

275 280 285 

Leu His Phe Phe Ser Gly Met Leu Met Ala Ala Gly Tyr Gly Trp Ala 

290 295 300 

Gly Leu Gly 
305 



<210> 36 
<211> 927 
<212> DNA 

<213> Synechocystis sp 



<400> 36 

atggcaacta tccaagcttt ttggcgcttc tcccgccccc ataccatcat tggtacaact 60 

ctgagcgtct gggctgtgta tctgttaact attctcgggg atggaaactc agttaactcc 120 

cctgcttccc tggatttagt gttcggcgct tggctggcct gcctgttggg taatgtgtac 180 

attgtcggcc tcaaccaatt gtgggatgtg gacattgacc gcatcaataa gccgaatttg 240 

cccctagcta acggagattt ttctatcgcc cagggccgtt ggattgtggg actttgtggc 300 

gttgcttcct tggcgatcgc ctggggatta gggctatggc tggggctaac ggtgggcatt 360 

agtttgatta ttggcacggc ctattcggtg ccgccagtga ggttaaagcg cttttccctg 420 

ctggcggccc tgtgtattct gacggtgcgg ggaattgtgg ttaacttggg cttattttta 480 

ttttttagaa ttggtttagg ttatcccccc actttaataa cccccatctg ggttttgact 540 

ttatttatct tagttttcac cgtggcgatc gccattttta aagatgtgcc agatatggaa 600 

ggcgatcggc aatttaagat tcaaacttta actttgcaaa tcggcaaaca aaacgttttt 660 

cggggaacct taattttact cactggttgt tatttagcca tggcaatctg gggcttatgg 720 

gcggctatgc ctttaaatac tgctttcttg attgtttccc atttgtgctt attagcctta 780 

ctctggtggc ggagtcgaga tgtacactta gaaagcaaaa ccgaaattgc tagtttttat 840 

cagtttattt ggaagctatt tttcttagag tacttgctgt atcccttggc tctgtggtta 900 

cctaattttt ctaatactat tttttag 927 



<210> 37 
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<211> 308 
<212> PRT 

<213> Synechocystis sp 



5 <400> 37 

Met Ala Thr He Gin Ala Phe Trp Arg Phe Ser Arg Pro His Thr He 

15 10 15 

He Gly Thr Thr Leu Ser Val Trp Ala Val Tyr Leu Leu Thr He Leu 

20 25 30 

10 Gly Asp Gly Asn Ser Val Asn Ser Pro Ala Ser Leu Asp Leu Val Phe 

35 40 45 

Gly Ala Trp Leu Ala Cys Leu Leu Gly Asn Val Tyr He Val Gly Leu 

50 55 60 

Asn Gin Leu Trp Asp Val Asp He Asp Arg He Asn Lys Pro Asn Leu 
15 65 70 75 80 

t Pro Leu Ala Asn Gly Asp Phe Ser He Ala Gin Gly Arg Trp He Val 

85 90 95 

y Gly Leu Cys Gly Val Ala Ser Leu Ala He Ala Trp Gly Leu Gly Leu 

3 100 105 110 

i|0 Trp Leu Gly Leu Thr Val Gly He Ser Leu He He Gly Thr Ala Tyr 

115 120 125 

£ Ser Val Pro Pro Val Arg Leu Lys Arg Phe Ser Leu Leu Ala Ala Leu 

3 130 135 140 

~ Cys He Leu Thr Val Arg Gly He Val Val Asn Leu Gly Leu Phe Leu 

25 145 150 155 160 

3 Phe Phe Arg He Gly Leu Gly Tyr Pro Pro Thr Leu He Thr Pro He 

165 170 175 

Trp Val Leu Thr Leu Phe He Leu Val Phe Thr Val Ala He Ala He 

180 185 190 

30 Phe Lys Asp Val Pro Asp Met Glu Gly Asp Arg Gin Phe Lys He Gin 

195 200 205 

Thr Leu Thr Leu Gin He Gly Lys Gin Asn Val Phe Arg Gly Thr Leu 

210 215 220 

He Leu Leu Thr Gly Cys Tyr Leu Ala Met Ala He Trp Gly Leu Trp 
35 225 230 235 240 

Ala Ala Met Pro Leu Asn Thr Ala Phe Leu He Val Ser His Leu Cys 

245 250 255 

Leu Leu Ala Leu Leu Trp Trp Arg Ser Arg Asp Val His Leu Glu Ser 

260 265 270 

4 0 Lys Thr Glu He Ala Ser Phe Tyr Gin Phe He Trp Lys Leu Phe Phe 

275 280 285 
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Leu Glu Tyr Leu Leu Tyr Pro Leu Ala Leu Trp Leu Pro Asn Phe Ser 

290 295 300 

Asn Thr lie Phe 

■ 305 

5 

<210> 38 
<211> 1092 
<212> DNA 

<213> Synechocystis sp 

10 

<400> 38 





' — l 4 — /"*r — s — \ -4 — 4— T — f — t 


cgcccca.cag 


t ggttaccat 


tggcaaggt c 


i iii 

aatcaccttt 


ctttgaaggt 


60 




LggLdcgLgc 


gcct gctt tt 


gccccaat cc 


ggggaaagtt 


ttgcttttat 


gtactccatc 


120 




gaaaatcctg 


ctagcgatca 


_i _i j 

tcattacggc 


ggcggtgctg 


tgcaaatttt 


agggccggct 


180 


J- 5 


acgaaaaaac 


aagaaaat ca 


ggaagaccaa 


cttgtttggc 


ggacatttcc 


ctcggtaaaa 


240 


Fb.F#i 

•V'"t 


aaattttggg 


ccagtcctcg 


ccagtttgcc 


ctagggcatt 


ggggaaaatg 


tagggataac 


300 




aggcaggcga 


aacccctact 


ctccgaagaa 


ttttttgcca 


cggtcaagga 


aggttatcaa 


360 


4 H 


atccatcaaa 


atcagcacca 


aggacaaatc 


attcatggcg 


atcgccattg 


tcgttggcag 


420 


r)Ull 


ttcaccgtag 


aaccggaagt 


aacttggggg 


agtcctaacc 


gatttcctcg 


ggctacagcg 


480 




ggttggcttt 


cctttttacc 


cttgtttgat 


cccggttggc 


aaattctttt 


agcccaaggt 


540 


**• 


agagcgcacg 


gctggctgaa 


atggcagagg 


gaacagtatg 


aatttgacca 


cgccctagtt 


600 




tatgccgaaa 


aaaattgggg 


tcactccttt 


ccctcccgct 


ggttttggct 


ccaagcaaat 


660 


hM>» 


tattttcctg 


accatccagg 


actgagcgt c 


actgccgctg 


gcggggaacg 


gattgttctt 


720 


hF»II 


ggtcgccccg 


aagaggtagc 


tttaattggc 


ttacatcacc 


aaggtaattt 


ttacgaattt 


780 




ggcccgggcc 


atggcacagt 


cacttggcaa 


gtagctccct 


ggggccgttg 


gcaattaaaa 


840 


■*IM-"'1- 


gccagcaatg 


ataggtattg 


ggtcaagttg 


tccggaaaaa 


cagataaaaa 


aggcagttta 


900 




gtccacactc 


ccaccgccca 


gggcttacaa 


ct caactgcc 


gagataccac 


taggggctat 


960 




ttgtatttgc 


aattgggat c 


tgtgggtcac 


ggcctgatag 


tgcaagggga 


aacggacacc 


1020 




gcggggctag 


aagttggagg 


tgattggggt 


ttaacagagg 


aaaatttgag 


caaaaaaaca 


1080 


30 


gtgccattct 


ga 










1092 



<210> 39 

<211> 363 

<212> PRT 

35 <213> Synechocystis sp 



<400> 39 

Met Lys Phe Pro Pro His Ser Gly Tyr His Trp Gin Gly Gin Ser Pro 

15 10 15 

Phe Phe Glu Gly Trp Tyr Val Arg Leu Leu Leu Pro Gin Ser Gly Glu 

20 25 30 
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Ser Phe Ala Phe Met Tyr Ser He Glu Asn Pro Ala Ser Asp His His 

35 40 45 

Tyr Gly Gly Gly Ala Val Gin He Leu Gly Pro Ala Thr Lys Lys Gin 
50 55 60 

5 Glu Asn Gin Glu Asp Gin Leu Val Trp Arg Thr Phe Pro Ser Val Lys 
65 70 75 80 

Lys Phe Trp Ala Ser Pro Arg Gin Phe Ala Leu Gly His Trp Gly Lys 

85 90 95 

Cys Arg Asp Asn Arg Gin Ala Lys Pro Leu Leu Ser Glu Glu Phe Phe 
10 100 105 110 

Ala Thr Val Lys Glu Gly Tyr Gin He His Gin Asn Gin His Gin Gly 

115 120 125 

Gin He He His Gly Asp Arg His Cys Arg Trp Gin Phe Thr Val Glu 
130 135 140 

15 Pro Glu Val Thr Trp Gly Ser Pro Asn Arg Phe Pro Arg Ala Thr Ala 
3 145 150 155 160 

^ Gly Trp Leu Ser Phe Leu Pro Leu Phe Asp Pro Gly Trp Gin He Leu 

g 165 170 175 

3 Leu Ala Gin Gly Arg Ala His Gly Trp Leu Lys Trp Gin Arg Glu Gin 
JO 180 185 190 

Tyr Glu Phe Asp His Ala Leu Val Tyr Ala Glu Lys Asn Trp Gly His 
* 195 200 205 

3 Ser Phe Pro Ser Arg Trp Phe Trp Leu Gin Ala Asn Tyr Phe Pro Asp 
;i 210 215 220 

'|< WH 

S25 His Pro Gly Leu Ser Val Thr Ala Ala Gly Gly Glu Arg He Val Leu 
| 225 230 235 240 

Gly Arg Pro Glu Glu Val Ala Leu He Gly Leu His His Gin Gly Asn 

245 250 255 

Phe Tyr Glu Phe Gly Pro Gly His Gly Thr Val Thr Trp Gin Val Ala 
30 260 265 270 

Pro Trp Gly Arg Trp Gin Leu Lys Ala Ser Asn Asp Arg Tyr Trp Val 

275 280 285 

Lys Leu Ser Gly Lys Thr Asp Lys Lys Gly Ser Leu Val His Thr Pro 
290 295 300 

35 Thr Ala Gin Gly Leu Gin Leu Asn Cys Arg Asp Thr Thr Arg Gly Tyr 
305 310 315 320 

Leu Tyr Leu Gin Leu Gly Ser Val Gly His Gly Leu He Val Gin Gly 

325 330 335 

Glu Thr Asp Thr Ala Gly Leu Glu Val Gly Gly Asp Trp Gly Leu Thr 
40 340 345 350 

Glu Glu Asn Leu Ser Lys Lys Thr Val Pro Phe 
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355 



360 



<210> 40 



<211> 56 



<212> DNA 



<213> Artifical Sequence 



<400> 40 



cgcgatttaa atggcgcgcc ctgcaggcgg ccgcctgcag ggcgcgccat ttaaat 



56 



<210> 41 
<211> 32 
<212> DNA 

<213> Artifical Sequence 
<400> 41 

tcgaggatcc gcggccgcaa gcttcctgca gg 32 

<210> 42 
<211> 32 
<212> DNA 

<213> Artifical Sequence 



<210> 43 
<211> 32 
<212> DNA 

<213> Artifical Sequence 
<400> 43 

tcgacctgca ggaagcttgc ggccgcggat cc 32 

<210> 44 
<211> 32 
<212> DNA 

<213> Artifical Sequence 



<400> 42 



tcgacctgca ggaagcttgc ggccgcggat cc 



32 



<400> 44 



tcgaggatcc gcggccgcaa gcttcctgca gg 



32 
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<210> 45 
<211> 36 
<212> DNA 

<213> Artifical Sequence 
<400> 45 

tcgaggatcc gcggccgcaa gcttcctgca ggagct 36 

<210> 46 
<211> 28 
<212> DNA 

<213> Artifical Sequence 
<400> 46 

cctgcaggaa gcttgcggcc gcggatcc 28 

<210> 47 
<211> 36 
<212> DNA 

<213> Artifical Sequence 
<400> 47 

tcgacctgca ggaagcttgc ggccgcggat ccagct 36 

<210> 48 
<211> 28 
<212> DNA 

<213> Artifical Sequence 
<400> 48 

ggatccgcgg ccgcaagctt cctgcagg 28 

<210> 49 
<211> 39 
<212> DNA 

<213> Artifical Sequence 
<400> 49 

gatcacctgc aggaagcttg cggccgcgga tccaatgca 39 



31 



<210> 50 
<211> 31 
<212> DNA 
■ <213> Artifical Sequence 

5 

<400> 50 

ttggatccgc ggccgcaagc ttcctgcagg t 

<210> 51 
10 <211> 41 
; <212> DNA 

<213> Artifical Sequence 

<400> 51 

;;15 ggatccgcgg ccgcacaatg gagtctctgc tctctagttc t 41 

. -^r,t* 

. If*.** 

;LJ <210> 52 
;0 <211> 38 

1 ELM- 

□ <212> DNA 

§0 <213> Artifical Sequence 

^ <400> 52 

; J ggatcctgca ggtcacttca aaaaaggtaa cagcaagt 38 

■ w-t 

S5 <21C> 53 
5 <211> 45 
<212> DNA 

<213> Artifical Sequence 
30 <400> 53 

ggatccgcgg ccgcacaatg gcgttttttg ggctctcccg tgttt 4 5 

<210> 54 
<211> 40 
35 <212> DNA 

<213> Artifical Sequence 

<400> 54 

ggatcctgca ggttattgaa aacttcttcc aagtacaact 40 

40 

<210> 55 
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31 
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<211> 38 



<212> DNA 



<213> Artifical Sequence 



<400> 55 



ggatccgcgg ccgcacaatg tggcgaagat ctgttgtt 



38 



<210> 56 
<211> 37 
<212> DNA 

<213> Artifical Sequence 
<400> 56 

ggatcctgca ggtcatggag agtagaagga aggagct 37 

<210> 57 
<211> 50 
<212> DNA 

<213> Artifical Sequence 



<210> 58 
<211> 38 
<212> DNA 

<213> Artifical Sequence 
<400> 58 

ggatcctgca ggtcacttgt ttctggtgat gactctat 38 

<210> 59 
<211> 38 
<212> DNA 

<213> Artifical Sequence 



<400> 57 



ggatccgcgg ccgcacaatg gtacttgccg aggttccaaa gcttgcctct 



50 



<400> 59 



ggatccgcgg ccgcacaatg acttcgattc tcaacact 



38 



<210> 60 



<211> 36 
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<212> DNA 

<213> Artifical Sequence 
<400> 60 

ggatcctgca ggtcagtgtt gcgatgctaa tgccgt 35 

<210> 61 
<211> 22 
<212> DNA 

<213> Artifical Sequence 
<400> 61 

taatgtgtac attgtcggcc tc 22 

<210> 62 
<211> 60 
<212> DNA 

<213> Artifical Sequence 
<400> 62 

gcaatgtaac atcagagatt ttgagacaca acgtggcttt ccacaattcc ccgcaccgtc 60 

<210> 63 
<211> 22 
<212> DNA 

<213> Artifical Sequence 
<400> 63 

aggctaataa gcacaaatgg ga 22 

<210> 64 
<211> 63 
<212> DNA 

<213> Artifical Sequence 
<400> 64 

ggtatgagtc agcaacacct tcttcacgag gcagacctca gcggaattgg tttaggttat 60 
ccc 63 

<210> 65 
<211> 26 
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<212> DNA 

<213> Artifical Sequence 
<400> 65 

ggatccatgg ttgcccaaac cccatc 26 

<210> 66 
<211> 61 
<212> DNA 

<213> Artifical Sequence 



<400> 66 

gcaatgtaac atcagagatt ttgagacaca acgtggcttt gggtaagcaa caatgaccgg 60 

61 



c 



<210> 67 
<211> 25 
<212> DNA 

<213> Artifical Sequence 
<400> 67 

gaattctcaa agccagccca gtaac 

<210> 68 
<211> 63 
<212> DNA 

<213> Artifical Sequence 



<400> 68 

ggtatgagtc agcaacacct tcttcacgag gcagacctca gcgggtgcga aaagggtttt 60 

63 



ccc 



<210> 69 
<211> 23 
<212> DNA 

<213> Artifical Sequence 
<400> 69 

ccagtggttt aggctgtgtg gtc 



<210> 70 
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<211> 21 
<212> DNA 

<213> Artifical Sequence 
5 <400> 70 

ctgagttgga tgtattggat c 21 



<210> 71 
<211> 28 
10 <212> DNA 

<213> Artifical Sequence 



15 



20 



<400> 71 

ggatccatgg ttacttcgac aaaaatcc 28 

<210> 72 
<211> 60 
<212> DNA 

<213> Artifical Sequence 
<400> 72 

gcaatgtaac atcagagatt ttgagacaca acgtggcttt gctaggcaac cgcttagtac 60 

<210> 73 
5 <211> 28 
<212> DNA 

<213> Artifical Sequence 



<400> 73 

30 gaattcttaa cccaacagta aagttccc 28 



<210> 74 
<211> 63 
<212> DNA 
35 <213> Artifical Sequence 

<400> 74 

ggtatgagtc agcaacacct tcttcacgag gcagacctca gcgccggcat tgtcttttac 60 
atg 63 



40 



<210> 75 



36 



<211> 20 
<212> DNA 

<213> Artifical Sequence 

5 <400> 75 

ggaacccttg cagccgcttc 

<210> 76 
<211> 22 
10' <212> DNA 

<213> Artifical Sequence 

<400> 76 

gtatgcccaa ctggtgcaga gg 

3.5 

3 <210> 77 

:^ <211> 28 

<212> DNA 
;3 <213> Artifical Sequence 

lo 

: ^ <400> 77 

^ ggatccatgt ctgacacaca aaataccg 

; f <210> 7 8 
25 <211> 62 
^ <212> DNA 

<213> Artifical Sequence 

<400> 78 

30 gcaatgtaac atcagagatt ttgagacaca 
ag 

<210> 79 
<211> 27 
35 <212> DNA 

<213> Artifical Sequence 

<400> 79 

gaattctcaa atccccgcat ggcctag 

40 

<210> 80 
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20 



22 



28 



acgtggcttt cgccaatacc agccaccaac 60 

62 



37 



Attorney Docket No: 

52015/01/US 

<211> 65 

<212> DNA 

<213> Artifical Sequence 
<400> 80 

ggtatgagtc agcaacacct tcttcacgag gcagacctca gcggcctacg gcttggacgt 60 
gtggg 65 

<210> 81 
<211> 21 
<212> DNA 

<213> Artifical Sequence 



<400> 81 

cacttggatt cccctgatct g 

<210> 82 
<211> 21 
<212> DNA 

<213> Artifical Sequence 
<400> 82 

gcaatacccg cttggaaaac g 

<210> 83 
<211> 29 
<212> DNA 

<213> Artifical Sequence 
<400> 83 

ggatccatga ccgaatcttc gcccctagc 

<210> 84 
<211> 61 
<212> DNA 

<213> Artifical Sequence 



<400> 84 

gcaatgtaac atcagagatt ttgagacaca acgtggcttt caatcctagg tagccgaggc 60 

61 



g 
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<210> 85 
<211> 27 
<212> DNA 

<213> Artifical Sequence 
<400> 85 

gaattcttag cccaggccag cccagcc 

<210> 86 
<211> 66 
<212> DNA 

<213> Artifical Sequence 



<400> 86 

ggtatgagtc agcaacacct tcttcacgag gcagacctca gcggggaatt gatttgttta 60 

attaCC rr 



<210> 87 
<211> 21 
<212> DNA 

<213> Artifical Sequence 
<400> 87 

gcgatcgcca ttatcgcttg g 

<210> 88 
<211> 24 
<212> DNA 

<213> Artifical Sequence 
<400> 88 

gcagactggc aattatcagt aacg 

<210> 89 
<211> 25 
<212> DNA 

<213> Artifical Sequence 
<400> 89 

ccatggattc gagtaaagtt gtcgc 



39 
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<210> 90 
<211> 0 

<213> Artifical Sequence 
<400> 90 

gaattcactt caaaaaaggt aacag 

<210> 91 
<211> 4550 
<212> DNA 

<213> Arabidopsis sp 
<400> 91 

attttacacc aatttgatca cttaactaaa ttaattaaat tagatgatta tcccaccata 60 

tttttgagca ttaaaccata aaaccatagt tataagtaac tgttttaatc gaatatgact 120 

cgattaagat taggaaaaat ttataaccgg taattaagaa aacattaacc gtagtaaccg 180 

taaatgccga ttcctccctt gtctaaaaga cagaaaacat atattttatt ttgccccata 240 

tgtttcactc tatttaattt caggcacaat acttttggtt ggtaacaaaa ctaaaaagga 300 

caacacgtga tacttttcct cgtccgtcag tcagattttt tttaaactag aaacaagtgg 360 

caaatctaca ccacattttt tgcttaatct attaacttgt aagttttaaa ttcctaaaaa 420 

agtctaacta attcttctaa tataagtaca ttccctaaat ttcccaaaaa gtcaaattaa 480 

taattttcaa aatctaatct aaatatctaa taattcaaaa tcattaaaaa gacacgcaac 540 

aatgacacca attaatcatc ctcgacccac acaattctac agttctcatg ctaaaccata 600 

ttttttgctc tctgttcctt caaaatcatt tctttctctt ctttgattcc caaagatcac 660 

ttctttgtct ttgatttttg attttttttc tctctggcgt gaaggaagaa gctttatttc 720 

atggagtctc tgctctctag ttcttctctt gtttccgctg gtaaatctcg tccttttctg 780 

gtttcaggtt ttatttgttg tttaggtttc gtttttgtga ttcagaacca tacaaaaagt 840 

ttgaactttt ctgaatataa aataaggaaa aagtttcgat ttttataatg aattgtttac 900 

tagatcgaag taggtgacaa aggttattgt gtggagaagc ataatttctg ggcttgactt 960 

tgaattttgt ttctcatgca tgcaacttat caatcagctg gtgggttttg ttggaagaag 1020 

cagaatctaa agctccactc tttatcaggt tcgttagggt tttatgggtt tttgaaatta 1080 

aatactcaat catcttagtc tcattattct attggttgaa tcacattttc taatttggaa 1140 

tttatgagac aatgtatgtt ggacttagtt gaagttcttc tctttggtta tagttgaagt 1200 

gttactgatg ttgtttagct ctttacacca atatatacac ccaattttgc agaaatccga 1260 

gttctgcgtt gtgattcgag taaagttgtc gcaaaaccga agtttaggaa caatcttgtt 1320 

aggcctgatg gtcaaggatc ttcattgttg ttgtatccaa aacataagtc gagatttcgg 1380 

gttaatgcca ctgcgggtca gcctgaggct ttcgactcga atagcaaaca gaagtctttt 1440 

agagactcgt tagatgcgtt ttacaggttt tctaggcctc atacagttat tggcacagtt 1500 

aagtttctct ttaaaaatgt aactctttta aaacgcaatc tttcagggtt ttcaaggaga 1560 

taacattagc tctgtgattg gatttgcagg tgcttagcat tttatctgta tctttcttag 1620 

cagtagagaa ggtttctgat atatctcctt tacttttcac tggcatcttg gaggtaatga 1680 



40 



10 
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atatataaca cataatgacc gatgaagaag atacattttt ttcgtctctc tgtttaaaca 1740 

attgggtttt gttttcaggc tgttgttgca gctctcatga tgaacattta catagttggg 1800 

ctaaatcagt tgtctgatgt tgaaatagat aaggtaacat gcaaattttc ttcatatgag 1860 

ttcgagagac tgatgagatt aatagcagct agtgcctaga tcatctctat gtgggttttt 1920 

gcaggttaac aagccctatc ttccattggc atcaggagaa tattctgtta acaccggcat 1980 

tgcaatagta gcttccttct ccatcatggt atggtgccat tttcacaaaa tttcaacttt 2040 

tagaattcta taagttactg aaatagtttg ttataaatcg ttatagagtt tctggcttgg 2100 

gtggattgtt ggttcatggc cattgttctg ggctcttttt gtgagtttca tgctcggtac 2160 

tgcatactct atcaatgtaa gtaagtttct caatactaga atttggctca aatcaaaatc 2220 

tgcagtttct agttttaggt taatgaggtt ttaataactt acttctacta caaacagttg 2280 

ccacttttac ggtggaaaag atttgcattg gttgcagcaa tgtgtatcct cgctgtccga 2340 

gctattattg ttcaaatcgc cttttatcta catattcagg tactaaacca ttttccttat 2400 

gttttgtagt tgttttcatc aaaatcactt ttatattact aaagctgtga aactttgttg 2460 

cagacacatg tgtttggaag accaatcttg ttcactaggc ctcttatttt cgccactgcg 2520 

|5 tttatgagct ttttctctgt cgttattgca ttgtttaagg taaacaaaga tggaaaaaga 2580 

y ttaaatctat gtatacttaa agtaaagcat tctactgtta ttgatgagaa gttttctttt 2640 

^ ttggttggat gcaggatata cctgatatcg aaggggataa gatattcgga atccgatcat 2700 

Q tctctgtaac tctgggtcag aaacgggtac gatatctaaa ctaaagaaat tgttttgact 2760 

3 caagtgttgg attaagatta cagaagaaag aaaactgttt ttgtttcttg caaaattcag 2820 

gtgttttgga catgtgttac actacttcaa atggcttacg ctgttgcaat tctagttgga 2880 

gccacatctc cattcatatg gagcaaagtc atctcggtaa caatctttct ttacccatcg 2940 

aaaactcgct aattcatcgt ttgagtggta ctggtttcat tttgttccgt tctgttgatt 3000 

ttttttcagg ttgtgggtca tgttatactc gcaacaactt tgtgggctcg agctaagtcc 3060 

gttgatctga gtagcaaaac cgaaataact tcatgttata tgttcatatg gaaggttaga 3120 

ttcgtttata aatagagtct ttactgcctt tttatgcgct ccaatttgga attaaaatag 3180 

cctttcagtt tcatcgaatc accattatac tgataaattc tcatttctgc atcagctctt 3240 

ttatgcagag tacttgctgt tacctttttt gaagtgactg acattagaag agaagaagat 3300 

ggagataaaa gaataagtca tcactatgct tctgttttta ttacaagttc atgaaattag 3360 

gtagtgaact agtgaattag agttttattc tgaaacatgg cagactgcaa aaatatgtca 3420 

aagatatgaa tttctgttgg gtaaagaagt ctctgcttgg gcaaaatctt aaggttcggt 3480 

gtgttgatat aatgctaagc gaagaaatcg attctatgta gaaatttccg aaactatgtg 3540 

taaacatgtc agaacatctc cattctatat cttcttctgc aagaaagctc tgtttttatc 3600 

acctaaactc tttatctctg tgtagttaag atatgtatat gtacgtgact acattttttt 3660 

gttgatgtaa tttgcagaac gtatggattt ttgttagaaa gcatgagttc gaaagtatat 3720 

35 gtttatatat atggataatt cagacctaac gtcgaagctc acaagcataa attcactact 3780 

atagtttgct ctgtaataga tagttccatt gatgtcttga aactgtacgt aactgcctgg 3840 

gcgttttgtg gttgatactg actactgagt gttctttgtg agtgttgtaa gtatacaaga 3900 

agaagaatat aggctcacgg gaacgactgt ggrggaagat gaaatggaga tcatcacgta 3960 

gcggctttgc caaagaccga gtcacgatcg agtctatgaa gtctttacag ctgctgatta 4020 

tgattgacca ttgcttagag acgcattgga atcttactag ggacttgcct gggagtttct 4 080 

tcaagtacgt gtcagatcat acgatgtagg agatttcacg gctttgatgt gtttgtttgg 4140 



SO 
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40 



41 



Attorney Docket No: 



5 















52015/01/US 


agtcacaatg 


cttaatgggc 


ttattggccc 


aataatagct 


agctcttttg 


ctttagccgt 


4200 


ttcgtttgtc 


ccctggtggt 


gagtattatt 


agggtatggt 


gtgaccaaag 


tcaccagacc 


4260 


tagagtgaat 


ctagtagagt 


cctagaccat 


ggtccatggc 


ttttatttgt 


aatttgaaaa 


4320 


atgaacaatt 


ctttttgtaa 


ggaaaacttt 


tatatagtag 


acatt t act a 


ta1"acf3^^p1" 

* — *->L V*i t^J. Oi- ^ 


^ JO u 


agttgaacta 


acttcgtgca 


attgcataat 


aatggtgtga 


aatagagggt 


gcaaaactca 


4440 


ataaacattt 


cgacgtacca 


agagttcgaa 


acaataagca 


aaatagattt 


ttttgcttca 


4500 


gactaarttg 


tacaatgaat 


ggttaataaa 


ccattgaagc 


ttttattaat 




4550 



10 



<210> 92 
<211> 4450 
<212> DNA 

<213> Arabiclopsis sp 



45 



85 



30 



35 



40 



<400> 92 

tttaggttac 

cttgtttgac 

aaagacatgg 

cactaatcta 

cctgaaggag 

aaaagt ctac 

ttgacagaga 

agcaccaaac 

caaggcaaat 

ctttctactg 

tcctgatgca 

aaaaagaatc 

attgagtacc 

caaatccagt 

caacttttac 

aaaaaaatga 

gaaattttgg 

aactttcttt 

taaagtatag 

gacaaaagtc 

agaaatctgt 

gattaactac 

gaatatacct 

tagagaggga 

gctccccagt 

atcataagaa 

ttacccaaaa 



aaaatcaatg atattgcgta 
cagaaggtca tgatcattgt 



tgtcaactat aaaagccaaa 
atacatacag ccaaactacc 



atcccaaaca 

aaagagttaa 

acccactttg 

ttcaattctt 

gagagtcttt 

cacttgttcg 

cacataattg 

cagtcagcac 

gcggccagtg 

aaaagacagt 

gagatctgca 

taaccaaagc 

atcatcttct 

tttaacctag 

gttcgtagct 

tctcacttct 

aaatcagatg 

gaacatcgaa 

ggtggtgaag 

tttgctactg 

gatgtgcata 

gtacaataga 

ttatggtcaa 

aatcagaaaa 

tgtaaacctc 



acaacaatag 

gtttcagctt 

tagcaagacc 

catatatagg 

attgaaaact 

acacaaatct 

gattgtgaaa 

cagatgataa 

atgcgtaata 

aaatggaatt 

ctgaatccag 

tttgtattat 

ttgtcctgga 

aatatctcaa 

tgtggcatat 

gttgcaaacg 

gaaaaggtgg 

atggatgcat 

ctagaaaaag 

gtcataatca 

aatagtatca 

tggtgctatg 

acctaaaaag 

tatataatgt 

ttcataagtg 



cttcttttac 
ttctggcaat 
atgtcctctg 
tt cct cacac 
tcttccaagt 
gtacagatat 
gagtacaaaa 
gtcagctgtc 
ttgccaccct 
aggaatcaca 



aagaaccagt 
ggctccttga 
tttcacttac 
tacagcttca 



agtaaagcct 
tcctggaaga 
agtaactagt 
tcatttcaat 
agtgtgtctc 



tcctcattcg 
acaactccac taaatataat 
aaaaacacta ttaggttttc 
gataaaccca aattttcata 
cctatttgcc atcctaactg 
taatcattag agcgagaaac 
aatgagtcct tgtaaagttt 



aaagtgcaag aaaacctatg gatgctgtgc 



caccgaatct 
gacacaatat 
aattacttgc 
actatttcat 
ggaagacttt 
gagatcaggg 
ttgcatgaga 
aaaacaaagc 
aatagatttt 
taaacaaggg 
cttcctttaa 
gcttgaggct 
ctaactttga 
ggtaggaaaa 



aagggctgtt 
attagacatt 
ataaaaactg 
tttcaatggg 
tatggggcta 
taattttctt 
catgaaacaa 
aagcaatatg 
gaagctaaaa 
tccagcagac 
ctgcagtcca 
gcaattataa 
gaagccagaa 
gacaagtaac 



gacttaacac 
agtccatgga 
aacttgagct 
ccacaaaggt 
actcttcact 
ctttatgatt 
aagctgaaaa 
cacacattga 
aataaaaagt 
tccggagaga 
tcctaacaat 
aaacgaatca 
tagatttaaa 
aaagatgaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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cccctaaaac 


aeggctgeag 


aatatacata 


ctgaaatgag 


ctcaaqtaqa 


aaaqaatttq 


1680 




atcacaaaac 


taaagacaag 


acctgagaac 


atat ctt cag 


aatttqqqcc 

—j -j 


aactacataa 


1740 




gggtgaacca 


tatgtgtatg 


tgaattttta 


aacaaacact 


tgcaaatacq 


cgactttagg 


1800 




gcaagtaaaa 


aatccaaaca 


aacctgtaat 


tgttaagttg 


gagaaqaatc 

— ' _F* _J 


cctaagccta 


1860 


5 


aaagcaactg 


cagcccgaga 


aatccaatcc 


cttgaaatqq 


tgt caaaaga 


cca ct aacaa 


J- ^ ^1* \J 




taggtcttag 


ttttgtacga 


tcaacctgga 


tataaaagaa 


attt gtaaga 


ca.aca.taai" c 






t aaaacaaaa 


caaccataca 


aaatcttgag 


ctttacatac 


aagcaaccca 


tctttqttta 


2040 




t ggaagaatg 


aatccagtta 


catgaatget 


gtgtatctac 


cctaactact 


aaacacatat 


2100 




t tcaatcgaa 


aaacatattc 


caccttcacc 


atat ctaaca 


cctgaaqt ct 


ttcacttttt 


2160 


10 


gaacgaagtc 


atcagaacat 


gcagataagc 


tattacccaa 


aacagaqata 


tqact craaaa 


2220 

{Li £—i \J 




tgttgtcgta 


aattgatcca 


acatagaaaa 


atcaagacca 


gttccagatq 


t caaagcaat 


2280 




aacactttcc 


caccatggtt 


acagaaacca 


tagttacaca 


aaacatqttt 


cctaaaccaa 


2340 




catactaaag 


ggatatataa 


atttgacatc 


actttatcac 


cataccataa 


qat aqct taa 


2400 




aaacaaactg 


acctttgtat 


ctatgtcctg 


atcaagcaqa 


tcatttataq 

-> 


tacaaccaac 


24 60 


,45 


acctctaaga 


agtaatgetc 


cgcaaccaaa 


taaagecata 


tatttaaaac 


t t ggaaggct 


2520 


* - *1 


tccaggatca 


gcagccaacg 


caatcgacct 


atacaacaat 


gatqqaqatt 


caoaatatca 


2 S8 0 


- F fJlUM 
■ l- «• 


atctatttac 


atagctctgg 


aactagatcc 


atgacqaaac 


atggaacat c 


qtt ataatat 


264 0 


I" I ' 

4 to U 
1 1 


ctaaagactt 


ccaaacagat 


tcctgagtaa 


gaaacccaqt 


ggaactataq 


t act ataaca 


2700 




tatataaaat 


caaagaaaac 


tcaggtttat 


agcattatcc 


aatcctgatt 


t ct accaat c 


27 (SO 


lie 

ffV**I 


cttaaccact 


ctcccatgct 


atcaaaaacc 


tcagctcaaq 


atcatactac 


ctaatt acct 


2820 




atgagctctt 


gggaagatca 


ttatggattt 


gataactqaa 


aaaagtaaca 


cfaaaaataac 


W U \J 


F- "l» 


agactgeaag 


aactactcca 


aactt ctcca 


ctgatatqta 


tgtagtctaa 


caataat aaa 


2 940 


4 +HFF* 
■ 1 

V 4i 

4. . null 


cagacataaa 


ttcttttatc 


aagcttcaag 


agcaagttaq 

— — -j 


tcagaaaaca 


t cacagccaa 


3000 




accaaccagg 


aaaacacata 


actttatcac 


ataaaactaa 


atttaatqta 

—> 


atctqactta 


3060 


-25 

■ -M — t 
F p 


acataaacca 


tcctttggga 


cgaaaggaaa 


ctatataaac 


atqcaqt ctt 


tctttpprtr 1 


J- t£. \J 


' F 
F 


agctattctt 


teggatggat 


tataatgaat 


ctcaaaaqtq 


aaat gtct tg 


a1"1~Pi"p^nT , i _ 

* — L k — \_s ^ oi t_ 


JlOU 




acattactca 


aaggegaaga 


taaacttacc 


acatacaaqq 


ccacgcaaqc 


aaccaaatt c 


324 0 




caatgggttt 


at ccaatcga 


gcaagcttag 


cataacctct 


aacttcttct 


CfCftaaataca 


3300 

vy \y 




aatctatcca 


agaagcttcc 


ttaacaacaa 


caccatcact 


cttctcctta 


tcatctttct 


3360 

V-/ -_p^ w v_/ 


30 


teggcttt cc 


ctccaaaacc 


gaagaagacg 


acgacattcc 


acaaattaat 


ct qt aattcc 


3^20 




aaccaacacc 


aaaaaacttc 


tectgatgea 


attctcttcc 


tttact ccat 


act taataat 


3480 




tatcattcca 


tgaaggataa 


cacttagtga 


aaggatttgt 


gtaatgqqta 


qtcacaacrat 


3540 




tggacaagga 


tttatgttgt 


gattgeaaaa 


gagcagagqa 


agaagatgga 


Q"t taccraacra 

~1 ^— V— V-A Vjj yJ Vwij 


3600 

^> \J \J KJ 


35 


eggaagattt 


caacaaccgt 


cttgaaacac 


gggagaqece 


aaaaaacqcc 


at cttt aaaa 


3660 

J l 1 U u 


gaaattgttg 


cctggaagaa 


acaaagactt 


gagatttcaa 


acgtaagtga 


att c 1 1 accra 


3720 




acgaaagcta 


acttctcaag 


agaatcagat 


tagtqattcc 


t caaaaacaa 


acaaaact at 


3780 




ct aatttcag 


tttcgagtga 


tgaagectta 


agaatctaqa 


acctccatqq 


cat tt ctaat 


38 4 0 

•J U T U 




ct ctcagaga 


taatcgaatt 


ccttaaacaa 


t caaagctta 


o a a a Q a cr a a n 


ClCL^CLCl^Cl.ClOCl 




40 


acaacaaaaa 


aaatcagatt 


aacaaccgac 


cagagagcaa 


cgacgacgcc 


ggcgagaaag 


3960 


ageaegtegt 


cteggagcaa 


gacttcttct 


ccagtaaccc 


ggatggatcg 


ttaatgggcc 


4020 




tgtagattat 


tatatttggg 


ccgaaacaat 


tgggtcagca 


aaaacttggg 


ggataatgaa 


4080 
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gaaacacgta cagtatgcat ttaggctcca aattaattgg ccatataatt cgaatcagat 4140 
aaactaatca acccctacct tacttatttc tcactgtttt tatttctacc ttagtagttg 4200 
aagaaacact tttatttatc ttttcgggac ccaaatttga taggatcggg ccattactca 4260 
tgagcgtcag acacatatta gccttatcag attagtgggg taaggttttt ttaattcggt 4320 
aagaagcaac aatcaatgtc ggagaaatta aagaatctgc atgggcgtgg cgtgatgata 4380 
tgtgcatatg gagtcagttg ccgatcatat ataactattt ataaactaca tataaagact 
actaatagat 



<210> 93 
<211> 2850 
<212> DNA 

<213> Arabidopsis sp 



4440 
4450 



60 



240 
300 
360 
420 
480 



<400> 93 

aattaaaatt tgagcggtct aaaccattag accgtttaga gatccctcca acccaaaata 
gtcgattttc acgtcttgaa catatattgg gccttaatct gtgtggttag taaagacttt 120 
tattggtcaa agaaaaacaa ccatggccca acatgttgat acttttattt aattatacaa 180 
gtacccctga attctctgaa atatatttga ttgacccaga tattaatttt aattatcatt 
tcctgtaaaa gtgaaggagt caccgtgact cgtcgtaatc tgaaaccaat ctgttcatat 
gatgaagaag tttctctcgt tctcctccaa cgcgtagaaa attctgacgg cttaacgatg 
tggcgaagat ctgttgttta tcgtttctct tcaagaatct ctgtttcttc ttcgttacca 
aaccctagac tgattccttg gtcccgcgaa ttatgtgccg ttaatagctt ctcccagcct 
ccggtctcga cggaatcaac tgctaagtta gggatcactg gtgttagatc tgatgccaat 540 
cgagtttttg ccactgctac tgccgccgct acagctacag ctaccaccgg tgagatttcg 600 
tctagagttg cggctttggc tggattaggg catcactacg ctcgttgtta ttgggagctt 660 
tctaaagcta aacttaggta tgtgtttact tttcttttct catgaaaaat ctgaaaattt 720 
ccaattgttg gattcttaaa ttctcatttg ttttatggtt gtagtatgct tgtggttgca 780 
acttctggaa ctgggtatat tctgggtacg ggaaatgctg caattagctt cccggggctt 
tgttacacat gtgcaggaac catgatgatt gctgcatctg ctaattcctt gaatcaggtc 
attgaaatgt tgagaagttc ataaatttcg aatccttgtt gtgtttatgt agttgatctt 
gcttgcttat gtttatgtag ttgaaaagtt taaaaatttc taatccttgg tagttgatct 
cgcttgtttg ttttttcatt ttctagattt ttgagataag caatgattct aagatgaaaa 
gaacgatgct aaggccattg ccttcaggac gtattagtgt tccacacgct gttgcatggg 1140 
ctactattgc tggtgcttct ggtgcttgtt tgttggccag caaggtgaat gtttgttttt 1200 
ttatatgtga tttctttgtt ttatgaatgg gtgattgaga gattatggat ctaaactttt 1260 
gcttccacga caaggttatt gcagactaat atgttggctg ctggacttgc atctgccaat 1320 
cttgtacttt atgcgtttgt ttatactccg ttgaagcaac ttcaccctat caatacatgg 1380 
gttggcgctg ttgttggtgc tatcccaccc ttgcttgggt aaatttttgt tccttttctt 1440 
ctttatttta gcagattctg ttttgttgga tactgctttt aattcaaaat gtagtcatgg 



840 
900 
960 
1020 
1080 



1500 



ttcaccaatt ctatgcttat ctattttgtg tgttgtcagg tgggcggcag cgtctggtca 1560 
gatttcatac aattcgatga ttcttccagc tgctctttac ttttggcaga tacctcattt 1620 
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tatggccctt gcacatctct gccgcaatga ttatgcagct ggagggtaag accatatggt 1680 

gtcatatgag attagaatgt ctccttccat gtagtgttga tcttgaacta gttcaatttc 1740 

gtggaatgat cagagtgtcc tagatagtgt cacagcagtc gacattttag tggctagata 1800 

atgagttctt tccgttagag ataaacattc gcgaacattg tttccagctt ccgcgaccca 1860 

acttctgatt ttgtttcttg gtaccttgtt ttcagttaca agatgttgtc actctttgat 1920 

ccgtcaggga agagaatagc agcagtggct ctaaggaact gcttttacat gatccctctc 1980 

ggtttcatcg cctatgactg tgagtcttgt agattcatct tttttttgta gtttattgac 2040 

tgcattgctg tatctgattt ttgctgttcc ttccaatttt tgtgacaggg gggttaacct 2100 

caagttggtt ttgcctcgaa tcaacacttc tcacactagc aatcgctgca acagcatttt 2160 

cattctaccg agaccggacc atgcataaag caaggaaaat gttccatgcc agtcttctct 2220 

tccttcctgt tttcatgtct ggtcttcttc tacaccgtgt ctctaatgat aatcagcaac 2280 

aactcgtaga agaagccgga ttaacaaatt ctgtatctgg tgaagtcaaa actcagaggc 2340 

gaaagaaacg tgtggctcaa cctccggtgg cttatgcctc tgctgcaccg tttcctttcc 2400 

tcccagctcc ttccttctac tctccatgat aacctttaag caagctattg aatttttgga 2460 

aacagaaatt aaaaaaaaaa tctgaaaagt tcttaagttt aatctttggt taataatgaa 2520 

gtggagaacg catacaagtt tatgtatttt ttctcatctc cacataattg tattttttct 2580 

ctaagtatgt ttcaaatgat acaaaataca tactttatca attatctgat caaattgatg 2640 

aatttttgag ctttgacgtg ttaggtctat ctaataaacg tagtaacgaa tttggttttg 2700 

gaaatgaaat ccgataaccg atgatggtgt agagttaaac gattaaaccg ggttggttaa 2760 

aggtctcgag tctcgacggc tgcggaaatc ggaaaatcac gattgaggac tttgagctgc 2820 

cacgaagatg gcgatgaggt tgaaatcaat 2850 

<210> 94 
<211> 3660 
<212> DNA 

<213> Arabidopsis sp 
<400> 94 

tatttgtatt tttattgtta aattttatga tttcacccgg tatatatcat cccatattaa 60 

tattagattt attttttggg ctttatttgg gttttcgatt taaactgggc ccattctgct 120 

tcaatgaaac cctaatgggt tttgtttggg ctttggattt aaaccgggcc cattctgctt 180 

caatgaaggt cctttgtcca acaaaactaa catccgacac aactagtatt gccaagagga 240 

tcgtgccaca tggcagttat tgaatcaaag gccgccaaaa ctgtaacgta gacattactt 300 

atctccggta acggacaacc actcgtttcc cgaaacagca actcacagac tcacaccact 360 

ccagtctccg gcttaactac caccagagac gattctctct tccgtcggtt ctatgacttc 420 

gattctcaac actgtctcca ccatccactc ttccagagtt acctccgtcg atcgagtcgg 4 80 

agtcctctct cttcggaatt cggattccgt tgagttcact cgccggcgtt ctggtttctc 540 

gacgttgatc tacgaatcac ccggtagtta gcattctgtt ggatagattg atgaatgttt 600 

tcttcgattt tttttttact gatcttgttg tggatctctc gtagggcgga gatttgttgt 660 

gcgtgcggcg gagactgata ctgataaagg tatgattttt tagttgtttt tattttctct 720 

ctcttcaaaa ttctcttttc aaacactgtg gcgtttgaat ttccgacggc agttaaatct 780 
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cagacacctg acaaggcacc agccggtggt tcaagcatta accagcttct cggtatcaaa 840 

ggagcatctc aagaaactgt aattttgttc atctcctcag aatcttttaa attatcatat 900 

ttgtggataa tgatgtgtta gtttaggaat tttcctacta aaggtaatct cttttgagga 960 

caagtcttgt ttttagctta gaaatgatgt gaaaatgttg tttgttagct aaaaagagtt 1020 

tgttgttata ttctgtattc agaataaatg gaagattcgt cttcagctta caaaaccagt 1080 

cacttggcct ccactggttt ggggagtcgt ctgtggtgct gctgcttcag gtaatcatac 1140 

gaacctcttt tggatcatgc aatactgtac agaaagtttt ttcattttcc ttccaattgt 1200 

ttcttctggc agggaacttt cattggaccc cagaggatgt tgctaagtcg attctttgca 1260 

tgatgatgtc tggtccttgt cttactggct atacacaggt ctggttttac acaacaaaaa 1320 

gctgacttgt tcttattcta gtgcatttgc ttggtgctac aataacctag acttgtcgat 1380 

ttccagacaa tcaacgactg gtatgataga gatatcgacg caattaatga gccatatcgt 1440 

ccaattccat ctggagcaat atcagagcca gaggtaactg agacagaaca ttgtgagctt 1500 

ttatctcttt tgtgattctg atttctcctt actccttaaa atgcaggtta ttacacaagt 1560 

ctgggtgcta ttattgggag gtcttggtat tgctggaata ttagatgtgt gggtaagttg 1620 

gcccttctga cattaactag tacagttaaa gggcacatca gatttgctaa aatcttccct 1680 

tatcaggcag ggcataccac tcccactgtc ttctatcttg ctttgggagg atcattgcta 1740 

tcttatatat actctgctcc acctcttaag gtaagtttta ttcctaactt ccactctcta 1800 

gtgataagac actccatcca agttttggag ttttgaatat cgatatctga actgatctca 1860 

ttgcagctaa aacaaaatgg atgggttgga aattttgcac ttggagcaag ctatattagt 1920 

ttgccatggt aagatatctc gtgtatcaat aatatatggc gttgttctca tctcattgat 1980 

ttgtttcttg ctcacttgac tgataggtgg gctggccaag cattgtttgg cactcttacg 2040 

ccagatgttg ttgttctaac actcttgtac agcatagctg gggtactctt ttggcaaacc 2100 

ttttatgttg cttttttcgt tatctgttgt aatatgctct tgcttcatgt tgtacctttg 2160 

tgataatgca gttaggaata gccattgtta acgacttcaa aagtgttgaa ggagatagag 2220 

cattaggact tcagtctctc ccagtagctt ttggcaccga aactgcaaaa tggatatgcg 2280 

ttggtgctat agacattact cagctttctg ttgccggtat gtactatcca ctgtttttgt 2340 

gcagctgtgg cttctatttc ttttccttga tcttatcaac tggatattca ccaatggtaa 2400 

agcacaaatt aatgaagctg aatcaacaaa ggcaaaacat aaaagtacat tctaatgaaa 2460 

tgagctaatg aagaggaggc atctactttt atgtttcatt agtgtgattg atggattttc 2520 

atttcatgct tctaaaacaa gtattttcaa cagtgtcatg aaataacaga acttatatct 2580 

tcatttgtac ttttactagt ggatgagtta cacaatcatt gttatagaac caaatcaaag 2640 

gtagagatca tcattagtat atgtctattt tggttgcagg atatctatta gcatctggga 2700 

aaccttatta tgcgttggcg ttggttgctt tgatcattcc tcagattgtg ttccaggtaa 2760 

agacgttaac agtctcacat tataattaat caaattcttg tcactcgtct gattgctaca 2820 

ctcgcttcta taaactgcag tttaaatact ttctcaagga ccctgtcaaa tacgacgtca 2880 

agtaccaggt aagtcaactt agtacacatg tttgtgttct tttgaaatat ctttgagagg 2940 

tctcttaatc agaagttgct tgaaacactc atcttgatta caggcaagcg cgcagccatt 3000 

cttggtgctc ggaatatttg taacggcatt agcatcgcaa cactgaaaaa ggcgtatttt 3060 

gatggggttt tgtcgaaagc agaggtgttg acacatcaaa tgtgggcaag tgatggcatc 3120 

aactagttta aaagattttg taaaatgtat gtaccgttat tactagaaac aactcctgtt 3180 

gtatcaattt agcaaaacgg ctgagaaatt gtaattgatg ttaccgtatt tgcgctccat 3240 
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ttttgcattt 


cctgctcata 


tcgaggattg 


gggtttatgt 


tagttctgtc 


acttctctgc 


3300 


tttcagaatg 


tttttgtttt 


ctgtagtgga 


ttttaactat 


tttcatcact 


ttttgtattg 


3360 


attctaaaca 


tgtatccaca 


taaaaacagt 


aatatacaaa 


aatgatactt 


cctcaaactt 


3420 


tttataatct 


aaat ctaaca 


actagctagt 


aacccaact a 


acttcataca 


attaatttga 


3480 


gaaactacaa 


agactagact 


atacatatgt 


tatttaacaa 


cttgaaactg 


tgttattact 


3540 


acctgatttt 


tttctattct 


acagccattt 


gatatgctgc 


aatcttaaca 


tatcaagtct 


3600 


cacgttgttg 


gacacaacat 


actat cacaa 


gtaagacacg 


aagtaaaacc 


aaccggcaac 


3660 



<210> 95 
10 <211> 

<212> DNA 

<213> soy 



ATGGATTCACTGCTTCTTCGATCTTTCCCTAATATTAATAACGCCTCTTCTCTCACCACCACTGGTGCAAATTTCTCC 
.15 AGGACTAAATCTTTCGCCAACATTTACCATGCAAGTTCTTATGTGCCAAATGCTTCATGGCACAATAGGAAAATCCAA 
; -f AAAGAATATAATTTTTTGAGGTTTCGGTGGCCAAGTTTGAACCATCATTACAAAGGCATTGAGGGAGCGTGTACATGT 
; 4i AAAAAATGTAATATAAAATTTGTTGTGAAAGCGACCTCTGAAAAATCTCTTGAGTCTGAACCTCAAGCTTTTGATCCA 
ill AAAAGCATTTTGGACTCTGTCAAGAATTCCTTGGATGCTTTCTACAGGTTTTCCAGGCCTCACACAGTTATTGGCACA 
|;fl GCATTAAGCATAATTTCTGTGTCTCTTCTTGCTGTTGAGAAAATATCAGATATATCTCCATTATTTTTTACTGGTGTG 
r 30 TTGGAGGCTGTGGTTGCTGCCCTGTTTATGAATATTTATATTGTTGGTTTGAATCAATTGTCTGATGTTGAAATAGAC 

AAGATAAACAAGCCGTATCTTCCATTAGCATCTGGGGAATATTCCTTTGAAACTGGTGTCACTATTGTTGCATCTTTT 
:;::f TCAATTCTGAGTTTTTGGCTTGGCTGGGTTGTAGGTTCATGGCCATTATTTTGGGCCCTTTTTGTAAGCTTTGTGCTA 
KM GGAACTGCTTATTCAATCAATGTGCCTCTGTTGAGATGGAAGAGGTTTGCAGTGCTTGCAGCGATGTGCATTCTAGCT 
KU GTTCGGGCAGTAATAGTTCAACTTGCATTTTTCCTTCACATGCAGACTCATGTGTACAAGAGGCCACCTGTCTTTTCA 
:25 AGACCATTGATTTTTGCTACTGCATTCATGAGCTTCTTCTCTGTAGTTATAGCACTGTTTAAGGATATACCTGACATT 
i>4 GAAGGAGATAAAGTATTTGGCATCCAATCTTTTTCAGTGCGTTTAGGTCAGAAGCCGGTGTTCTGGACTTGTGTTACC 

CTTCTTGAAATAGCTTATGGAGTCGCCCTCCTGGTGGGAGCTGCATCTCCTTGTCTTTGGAGCAAAATTTTCACGGGT 

CTGGGACACGCTGTGCTGGCTTCAATTCTCTGGTTTCATGCCAAATCTGTAGATTTGAAAAGCAAAGCTTCGATAACA 

! :sSs TCCTTCTATATGTTTATTTGGAAGCTATTTTATGCAGAATACTTACTCATTCCTTTTGTTAGATGA 
,10 

- » M«W 

i3[ <21C> 96 

<211> 

<212> DNA 
35 <213> soy 



ATGGATTCGATGCTTCTTCGATCTTTTCCTAATATTAACAACGCTTCTTCTCTCGCCACCACTGGTTCTTATTTGCCA 
AATGCTTCATGGCACAATAGGAAAATCCAAAAAGAATATAATTTTTTGAGGTTTCGGTGGCCAAGTTTGAACCACCAT 
TACAAAAGCATTGAAGGAGGGTGTACATGTAAAAAATGTAATATAAAATTTGTTGTGAAAGCGACCTCTGAAAAATCT 

4 0 TTTGAGTCTGAACCCCAAGCTTTTGATCCAAAAAGCATTTTGGACTCTGTCAAGAATTCCTTGGATGCTTTCTACAGG 
TTTTCCAGACCTCACACAGTTATTGGCACAGCATTAAGCATAATTTCTGTGTCCCTCCTTGCTGTTGAGAAAATATCA 
GATATATCTCCATTATTTTTTACTGGTGTGTTGGAGGCTGTGGTTGCTGCCCTGTTTATGAATATTTATATTGTTGGT 
TTGAATCAATTGTCTGATGTTGAAATAGACAAGATAAACAAGCCGTATCTTCCATTAGCATCTGGGGAATATTCCTTT 
GAAACTGGTGTCACTATTGTTGCATCTTTTTCAATTCTGAGTTTTTGGCTTGGCTGGGTTGTAGGTTCATGGCCATTA 

4 5 TTTTGGGCCCTTTTTGTAAGCTTTGTGCTAGGAACTGCTTATTCAATCAATGTGCCTCTGTTGAGATGGAAGAGGTTT 
GCAGTGCTTGCAGCGATGTGCATTCTAGCTGTTCGGGCAGTAATAGTTCAACTTGCATTTTTCCTTCACATCCAGACT 
CATGTATACAAGAGGCCACCTGTCTTTTCAAGATCATTGATTTTTGCTACTGCATTCATGAGCTTCTTCTCTGTAGTT 
ATAGCACTGTTTAAGGATATACCTGACATTGAAGGAGATAAAGTATTTGGCATCCAATCTTTTTCAGTGCGTTTAGGT 
CAGAAGCCGGTATTCTGGACTTGTGTTATCCTTCTTGAAATAGCTTATGGAGTCGCCCTCCTGGTGGGAGCTGCATCT 

50 CCTTGTCTTTGGAGCAAAATTGTCACGGGTCTGGGACACGCTGTTCTGGCTTCAATTCTCTGGTTTCATGCCAAATCT 

GTAGATTTGAAAAGCAAAGCTTCGATAACATCCTTCTATATGTTTATTTGGAAGCTATTTTATGCAGAATACTTACTC 
ATTCCTTTTGTTAGATGA 
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<210> 97 
<211> 
<212> PRT 
<213> soy 



MDSMLLRSFPNINNASSLATTGSYLPNASWHNRKIQKEYNFLRFRWPSLNHHYKSIEGGCTCKKCNIKFVVKATSEKS 

FESEPQAFDPKSILDSVKNSLDAFYRFSRPHTVIGTALSIISVSLLAVEKISDISPLFFTGVLEAVVAALFMNIYIVG 

LNQLSDVEIDKINKPYLPLASGEYSFETGVTIVASFSILSFWLGWVVGSWPLFWALFVSFVLGTAYSINVPLLRWKRF 

AVLAAMCILAVRAVIVQLAFFLHIQTHVYKRPPVFSRSLIFATAFMSFFSVVIALFKDIPDIEGDKVFGIQSFSVRLG 

QKPVFWTCVILLEIAYGVALLVGAASPCLWSKIVTGLGHAVLASILWFHAKSVDLKSKASITSFYMFIWKLFYAEYLL 
IPFVR 

<210> 98 
<211> 
<212> PRT 
<213> soy 



MDSLLLRSFPNINNASSLTTTGANFSRTKSFANIYHASSYVPNASWHNRKIQKEYNFLRFRWPSLNHHYKGIEGACTC 

KKCNIKFVVKATSEKSLESEPQAFDPKSILDSVKNSLDAFYRFSRPHTVIGTALSIISVSLLAVEKISDISPLFFTGV 

LEAVVAALFMNI YIVGLNQLSDVEIDKINKPYLPLASGEYSFETGVTIVASFSILSFWLGWVVGSWPLFWAL FVSFVL 

GTAYSINVPLLRWKRFAVLAAMCILAVRAVIVQLAFFLHMQTHVYKRPPVFSRPLIFATAFMSFFSVVIALFKDIPDI 

EGDKVFGIQSFSVRLGQKPVFWTCVTLLEIAYGVALLVGAASPCLWSKIFTGLGHAVLASILWFHAKSVDLKSKASIT 
SFYMFIWKLFYAEYLL IPFVR 

<210> 99 
<211> 
<212> DNA 
<213> rice 



GAGCAGCACTGGGTCTTACATTCCAATGGAGCTCGCCTGTTGCTTTCATTACATGCTTCG 

TGACTTTATTTGCTTTGGTCATTGCTATAACCAAAGATCTCCCAGATGTTGAAGGGGATC 

GGAAGTATCAAATATCAACTTTGGCGACAAAGCTCGGTGTCAGAAACATTGCATTTCTTG 

GCTCTGGTTTATTGATAGCAAATTATGTTGCTGCTATTGCTGTAGCTTTTCTCATGCCTC 

AGGCTTTCAGGCGCACTGTAATGGTGCCTGTGCATGCTGCCCTTGCCGTTGGTATAATTT 

TCCAGACATGGGTTCTGGAGCAAGCAAAATATACTAAGGATGCTATTTCACAGTACTACC 

GGTTCATTTGGAATCTCTTCTATGCTGAATACATCTTCTTCCCGTTGATATAGAGACCAA 

GCAATCTGATATGGTCTGCATGTTGAGTGCGGCAAAAACTAGAAGCCCATATGAACAGTG 

GGAGTAGGGGAACGAACATGCCATCCATGGGAAGACTCTGATAACTCTCTCTCGCCCGGG 

CTGTAAAGGGTAAGCACTGTTGGGCATATATATGAAAGGAAGGTGATAAAGCAGGGATGC 

TAAATTGCTACTGGGATCCTCAAAGGCTTATAGTGGTCACCAGTGGAATGTGCCTTAATA 

ATTTGGTTACCCAGCAGAGCAAGTTTTTGCAGGTTATTAGGTAATATCTTTGAGGGAATG 

AACTTAGATTTCATTGTTTTAAGGTCTGGTCACACAACGGGTAGTAGTGCTGGAGCGGCA 

AAAAACGACCTTGTTTTACACTACCAAGGGAGGTTAACTCTAGTTTTCATGTGACCACTT 

ACCTTGAGAGTTGAGACCATGGAATCACTTGTCGACTCCTCGGCTTGTATATTTCTAGTG 

TCAGCATTTGCATTCTCCTCCCCACTTGTACTTGAAAAGTTGAAGACAACTTTTTTGTTT 
GTGT 

<210> 100 
<211> 
<212> DNA 
<213> wheat 



CGTCCGCGGACGCGTGGGTGCTTATTCAGTCAATCTGCCGCACTTTCTATGGAAGAGATC 
TGCTGTTGTTGCAGCACTCTGCATATTAGCAGTGCGTGCGGTGATAGTTCAACTGGCATT 
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TTTTCTCCACATTCAGACATTTGTTTTCAGAAGGCCGGCAGACTTTTCAAAGCCATTGAT 
ATTTGCAACTGCCTTCATGACATTCTTCTCAGTTGTAATAGCATTATTCAAGGATATACC 
CGATATTGAAGGGGACCGCATCTTTGGAATCCAATCTTTTAGTGGTAGACTAGGTCAAAG 
CAGGGGTTTCTGGACTTGCGTTGGCCTACTTGAGGTTGCCTACGGTGTTGCGATACTGAG 
5 GGGGGTAACTTCTTCCAGTTTGTGGAGCAAATCTATAACTGTTGTGGGCCATGCAATCCT 
C 



• <210> 101 

10 <211> 

<212> DNA 

<213> leek 



GTTTCCCCCCCTCGAATTTTTTTTTTTTTTTTTTACTTCATTTTTCTGTGAATAAATTCT 
1 5 T AAAAAAG ACAAAGAAAAC CAC T GGAT AT C C TAAAT T C AAC AT AG GCTATTGT CAT T CAA 
T GATAAT CTT T AAC AC AAC AT AC AAC AT GAATATAAT TAAG GAGAAAT GAT C T GCAAT T G 
TTGAAAGAACTCTCCGTTTTTAAGATGACAATTAAAGCGTTGTTAATTCCAGCCATTTCT 
GCCTCCATTATCTACTCATCTTCTCTTGCGATTCTTTTCCATGTAGGTCATAAACCCTCA 
T C T TAC AAAAGGAAT GAGCAAGT AC T CAGCAT AGAAGAGCT T CC ACACGAAC ATAT AAAA 
20 AG AT G T AAT AG T GG T T T T GG T CAT T GG T CC AT GAGAT C T AGC AC GAT T C C AAAG T AAC GA 

3 CCCAAGAATTGCATGACCTATCACTGTTAAGCATTTGCTCCATAGGCATGAGGAAGTAGC 
II T CCAACAAC CAT GAC AACAGT GT AGGCCAT C T CAAGGAGAT AT AT AC ATAT C CAAAAC AC 

11 CCTCTCCTGGCCAAGGCGCACGCTGAAAGAATGGATGCCAAATATTTTGTCTCCGTCTAT 
^ AT CAGGT ATAT CCT TAAAT AGAGC AAT AAC AAC TGAGAAGAAGCT CAT GAAGGCAGT T GC 

2 5 AAATATCAATGGCCTTGTGAAACTTGCTGGTCTTTTGAAAACAAA 

3 <210> 102 



" : L" <211> 

; 7 <212> DNA 

KS0 <213> leek 



NATTCGGCACGAGTTTTGAAGAAGTTAAGCATGGACTCCCTCCTTACCAAGCCAGTTGTA 
; Q ATACCTCTGCCTTCTCCAGTTTGTTCACTACCAATCTTGCGAGGCAGTTCTGCACCAGGG 
^ CAGTATTCATGTAGAAACTACAATCCAATAAGAATTCAAAGGTGCCTCGTAAATTATGAA 
%5 CAT G T GAAAC CAAGG T T T ACAAC AT G TAG T AG GT C T C AAAAAC T T GG T CAT G T AAAAG C C 

^ ACATCCGAGCATTCTTTAGAATCTGGATCCGAAGGATACACTCCTAGAAGCATATGGGAA 
GCCGTACTAGCTTCACTGAATGTTCTATACAAATTTTCACGACCTCACACAATAATAGGA 
ACAGCAATGGGCATAATGTCAGTTTCTTTGCTTGTTGTCGAGAGCCTATCCGATATTTCT 
CCTCTGTTTTTTGTGGGATTATTAGAGGCTGTGGTTGCTGCATTGTTTATGAATGTTTAC 

40 ATTGTAGGTCT GAAT C AAT TATTT G AC AT AGAAAT AGAC AAG G T C AAT AAAC C T GAT CTT 

CCTCTTGCATCTGGAGAATACTCACCAAGAGCTGGTACTGCTATTGTCATTGCTTCAGCC 
ATCATGAGCTTTGGCATTGGATGGTTAGTTGGCTCTT 

<210> 103 
45 <211> 

<212> DNA 

<213> canola 



TTTTTTTTTTTTTTTTTCAAAAAGACCAATCCTTTAGTATGTACAT 
50 GAACAAAGTGATTTTGTCTCCAAGCTACAAAGAAGAAGAAGAGAGGT AT ACAAAGAAAAC 
TACAAATGTTCACCATGAATGCTAGAAGAAGGGGAATAACAGATACTCTGCGTAGAAGAG 
ATTCCATATAAACCGGTAATATCCTGCTATAGCTTCCTTTGTGTAGTTTGCTTTTTCTAG 
CACCCATGTCTGGAAAACCAAGCATGAAGCCAAGATCATATGTGCAGGAATCATCAAGCT 
ACCTCTAAAAACCTGAGGCATGTAGAAAGCTAGTGATATGGCAGAAATATAGTTCACTAG 
55 CAGAAGTCCAGAACCGAGGAATGCAATGTTCCTCACTCCAAGCTTTGTTGCTAGTGTTGA 
TATTTGGAACTTGCGATCTCCTTCAACATCAGGAAGATCTTTTGTAATAGCAATGACTAG 
TG C AAA C AG T G T CAC AAAAG A C G T GAT G AAAG C CAC AG G T G C A C T C CAC T GAAAC G AAAG 
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TCCAAGAGCAGCTCTAGTAGCATGGTACACACCAAAATTAAGAAGAAAACCTCGTACCGT 
GGCAATAATAAGAAACGCTGCAACTGGAAATCTCTTCATTCTAAATGGTGGAACAGAATA 
GATGGTCCCCAGATCGGACGCGTGGGTCGAC 

5 <210> 104 
<211> 
<212> DNA 
<213> corn 



10 CCACGCGTCCGCCCGGCCAAGGGATGGACGCGCTTCGCCTACGGCCGTCCCTCCTCCCCG 
TGCGGCCCGGCGCGGCCCGCCCGCGAGATCATTTTCTACCACCATGTTGTTCCATACAAC 
GAAATGGTGAAGGACGAATTTGCTTTTCTAGCCAAAGGACCCAAGGTCCTACCTTGCATC 
ACCATCAGAAATTCTTCGAATGGAAATCCTCCTATTGTAGGATATCACATCGGTCATTAA 
ATACTTCTGTTAATGCTTCGGGGCAACAGCTGCAGTCTGAACCTGAAACACATGATTCTA 
15 CAACCATCTGGAGGGCAATATCATCTTCTCTAGATGCATTTTACAGATTTTCCCGGCCAC 
ATACTGTCATAGGAACAGCATTAAGCATAGTCTCAGTTTCCCTTCTAGCTGTCCAGAGCT 
TGTCTGATATATCACCTTTGTTCCTCACTGGTTTGCTGGAGGCAGTGGTAGCTGCCCTTT 
TCATGAATATCTATATTGTTGGACTGAACCAGTTATTCGACATTGAGATAGACAAGGTTA 
ACAAGCCAACTCTTCCATTGGCATCTGGGGAATACACCCTTGCAACTGGGGTTGCAATAG 
2 0 TTTCGGTCTTTGCCGCTATGAGCTTTGGCCTTGGATGGGCTGTTGGATCACAACCTCTGT 
^ TTTGGGCTCTTTTCATAAGCTTTGTTCTTGGGACTGCATATTCAATCAATCTGCCGTACC 
: ;f TTCGATGGAAGAGATTTGCTGTTGTTGCAGCACTGTGCATATTAGCAGTTCGTGCAGTGA 
: y TTGTTCAGCTGGCCTTTTTTCTCCACATTCAGACTTTTGTTTTCAGGAGACCGGCAGTGT 
Iff TTTCTAGGCCATTATTATTTGCAACTGGATTTATGACGTTCTTCTCTGTTGTAATAGCAC 
TATTCAAGGATATACCTGACATCGAAGGGGACCGCATATTCGGGATCCGATCCTTCAGCG 
:2 TCCGGTTAGGGCAAAAGAAGGTCTTTTGGATCTGCGTTGGCTTGCTTGAGATGGCCTACA 
;f GCGTTGCGATACTGATGGGAGCTACCTCTTCCTGTTTGTGGAGCAAAACAGCAACCATCG 
I! CTGGCCATTCCATACTTGCCGCGATCCTATGGAGCTGCGCGCGATCGGTGGACTTGACGA 
'•f$ GCAAAGCCGCAATAACGTCCTTCTACATGTTCATCTGGAAGCTGTTCTACGCGGAGTACC 
: 30 TGCTCATCCCTCTGGTGCGGTGAGCGCGAGGCGAGGTGGTGGCAGACGGATCGGCGTCGG 
= y CGGGGCGGCAAACAACTCCACGGGAGAACTTGAGTGCCGGAAGTAAACTCCCGTTTGAAA 
GTTGAAGCGTGCACCACCGGCACCGGGCAGAGAGAGACACGGTGGCTGGATGGATACGGA 
t & TGGCCCCCCCAATAAATTCCCCCGTGCATGGTAAAAAAAAAAAAAAAAAAA 

: S5 <210> 105 

*• 

i >r** 

r <211> 

;1 <212> DNA 

:J <213> corn 



40 GCCGCGCAGCGCGACGAGCGCCACCTGCTTGCTGCCGCGTGCCTGCGTGCGTGTGCGTCC 
ACCACTGACCCCGCGCCCGCCCGCCGCCCCTGCCCCTCCACTCCACTTGCTCACTCGTCG 
CGGCCCGCTTCCCCCCCGGCCAAGGGATGGACGCGCTTCGCCTACGGCCGTCCCTCCTCC 
CCGTGCGGCCCGGCGCGGCCCGCCCGCGAGATCATTTTCTACCACCATGTTGTTCCATAC 
AACGAAATGGTGAAGGACGAATTTGCTTTTCTAGCCAAAGGACCCAAGGTCCTACCTTGC 

45 ATCACCATCAGAAATTCTTCGAATGGAAATCCTCCTATTGTAGGATAT CACATCGGTCAT 
TAAATACTTCTGTTAATGCTTCGGGGCAACAGCTGCAGTCTGAACCTGAAACACATGATT 
CTACAACCATCTGGAGGGCAATAT CATCTTCTCTAGATGCATTTTACAGATTTTCCCGGC 
CACATACTGTCATAGGAACAGCATTAAGCATAGTCTCAGTTTCCCTTCTAGCTGTCCAGA 
GCTTGTCTGATATAT CACCTTTGTTCCTCACTGGTTTGCTGGAGGCAGTGGTAGCTGCCC 

50 TTTTCATGAATATCTATATTGTTGGACTGAACCAGTTATTCGACATTGAGATAGACAAGG 
TTAACAAGCCAACTCTTCCATTGGCATCTGGGGAATACACCCTTGCAACTGGGGTTGCAA 
TAGTTTCGGTCTTTGCCGCTATGAGCTTTGGCCTTGGATGGGCTGTTGGATCACAACCTC 
TGTTTTGGGCTCTTTTCATAAGCTTTGTTCTTGGGACTGCATATTCAAT CAATCTGCCGT 

ACCTTCGATGGAAGAGATTTGCTGTTGTTGCAGCACTGTGCATATTAGCAGTTCGTGCAG 
55 TGATTGTTCAGCTGGCCTTTTTTCTCCACATTCAGACTTTTGTTTTCAGGAGACCGGCAG 
TGTTTTCTAGGCCATTATTATTTGCAACTGGATTTATGACGTTCTTCTCTGTTGTAATAG 
CACTATTCAAGGATATACCTGACATCGAAGGGGACCGCATATTCGGGATCCGATCCTTCA 
GCGTCCGGTTAGGGCAAAAGAAGGTCTTTTGGATCTGCGTTGGCTTGCTTGAGATGGCCT 
ACAGCGTTGCGATACTGATGGGAGCTACCTCTTCCTGTTTGTGGAGCAAAACAGCAACCA 
60 TCGCTGGCCATTCCATACTTGCCGCGATCCTATGGAGCTGCGCGCGATCGGTGGACTTGA 
CGAGCAAAGCCGCAATAACGTCCTTCTACATGTTCATCTGGAAGCTGTTCTACGCGGAGT 
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ACCTGCTCATCCCTCTGGTGCGGTGAGCGCGAGGCGAGGTGGTGGCAGACGGATCGGCGT 

CGGCGGGGCGGCAAACAACTCCACGGGAGAACTTGAGTGCCGGAAGTAAACTCCCGTTTG 

AAAGTTGAAGCGTGCACCACCGGCACCGGGCAGAGAGAGACACGGTGGCTGGATGGATAC 

GGATGGCCCCCCCAATAAATTCCCCCGTGCATGGTACCCCACGCTGCTTGATGATATCCC 

5 ATGTGTCCGGGTGACCGGACCTGATCGTCTCTAGAGAGATTGGTTGCACAACGTCCAACA 

TAGCCCGTAGGTATTGCTACCACTGCTAGTATGATACTCCTTCCTAGTCCTTGCCAGCAC 

CAGTGACCCAAACTTGGTCGGCTGAGCTCAGCGCTCAGCAGCTTTACGTGCATCTGCGCC 

TTGACTTGTGCAGTGGGCGTCGCTAGCATGAATGATGTATGGTGCGTCACGGCCTGACGG 

TTCGTCAGTCTGGGCCGTGTTTTGTGTCCGAGGAAGATCGTCTGTCAGAGATCTGGATTG 
10 CCTCGCTGCT 

<210> 106 
<211> 
<212> DNA 

15 <213> corn 



CGGCCGGACTCTTCTGACTTGGCAACCGCCGCGCAGCGCGACGAGCGCCACCTGCTTGCT 
GCCGCGTGCCTGCGTGCGTGTGCGTCCACCACTGACCCCGCGCCCGCCCGCCGCCCCTGC 
CCCTCCACTCCACTTGCTCACTCGTCGGCTCGTCGCGGCCCGCTTCCCCCCCGGCCAAGG 
2 0 GATGGACGCGCTTCGCCTACGGCCGTCCCTCCTCCCCGTGCGGCCCGGCGCGGCCCGCCC 
= GCGAGGCAGTGGTAGCTGCCCTTTTCATGAATATCTATATTGTTGGACTGAACCAGTTAT 
:; :f TCGACATTGAGATAGACAAGGTTAACAAGCCAACTCTTCCATTGGCATCTGGGGAATACA 
CCCTTGCAACTGGGGTTGCAATAGTTTCGGTCTTTGCCGCTATGAGCTTTGGCCTTGGAT 
jjl GGGCTGTTGGATCACAACCTCTGTTTTGGGCTCTTTTCATAAGCTTTGTTCTTGGGACTG 
;=35 CATATTCAAT CAATCTGCCGTACCTTCGATGGAAGAGATTTGCTGTTGTTGCAGCACTGT 
;;Jf GCATATTAGCAGTTCGTGCAGTGATTGTTCAGCTGGCCTTTTTTCTCCACATTCAGACTT 
j.y TTGTTTTCAGGAGACCGGCAGTGTTTTCTAGGCCATTATTAT 

m <210> 107 

; Jo <2ii> 

; : <212> DNA 

<213> cotton 



J™ CCCACGCGTCCGAACATTGTTTGCACTTGTTATTGCCATAACCAAGGATCTTCCAGATGT 
, : B;5 AGAAGGAGATCGCAAATTTCAAATATCAACATTAGCAACAAAGCTTGGAGTTAGAAATAT 

TGCATTTCTTGGTTCCGGACTTCTACTGGTGAATTATGTTGCTGCTGTGTTGGCTGCAAT 
O . ATACATGCCTCAGGCTTTCAGGCGTAGTTTAATGATACCTGCTCATATCTTTTTGGCGGT 

CTGCTTGATTTTTCAGACATGGGTGTTGGAACAAGCAAATTACAAAAAGGAAGCAATCTC 

GGGGTTCTATCGTTTCATATGGAATCTCTTCTATGCAGAGTATGCGATTTTCCCCTTCGT 
4 0 GT 

<210> 108 
<211> 
<212> DNA 
45 <213> tomato 



CAGATCAATTCCAGTTCCTGCTGAGTTTTCTCCACTCAAAACCAGTTCACATGCAATAGT 
ACGGGTTTTGAAATGTAAAGCATGGAAGAGACCAAAAAAGCACTATTCCTCTTCAATGAA 
GTTGCAGCGGCAGTATATCACGCAAGAGCATGTTGGAGGAAGTGATCTAAGCACTATTGC 

50 TGCTGATAAAAAACTTAAAGGGAGATTTTTGGTGCACGCATCATCTGAACACCCTCTTGA 
ATCTCAACCTTCTAAAAGTCCTTGGGACTCAGTTAATGATGCCGTAGATGCTTTCTACAG 
GTTCTCGCGGCCCCATACCATAATAGGAACAGCATTGAGCATAATTTCAGTTTCTCTCCT 
TGCAGTTGAGAAGTTCTCTGATTTTTCTCCATTATTTTTCACTGGGGTGTTAGAGGCCAT 
TGTTGCTGCCCTATTCATGAACATTTACATAGTTGGTTTAAACCAGTTGTCTGACATCGA 

55 AATAGACAAGGTAAACAAGCCATATCTTCCATTGGCATCAGGGGAATACTCTGTACAAAC 
TGGAGTGATTGTTGTGTCGTCTTTTGCCATTTTGA 

<210> 109 
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<211> 

<212> DNA 

<213> Arabidopsis 

5 AACACCAAACACACAATTTCACATTCTTTTGCATATTTCTTCTTCTTCTTCCATTATGGA 
GATACGGAGCTTGATTGTTTCTATGAACCCTAATTTATCTTCCTTTGAGCTCTCTCGCCC 
TGTATCTCCTCTCACTCGCTCACTAGTTCCGTTCCGATCGACTAAACTAGTTCCCCGCTC 
CATTTCTAGGGGGATCCCGTCGATCTCCACCCCGAATAGTGAAACTGACAAGATCTCCGT 
TAAACCTGTTTACGTCCCGACGTCTCCCAATCGCGAACTCCGGACTCCTCACAGTGGATA 
10 CCATTTCGATGGAACACCTCGGAAGTTCTTCGAGGGATGGTGGATCCGGGTTTCCATCCC 
AGAGAAGAGGGAGAGTTTTTGTTTTATGTATTCTGTGGAGAATCCTGCATTTCGGCAGAG 
TTTGTCACCATTGGAAGTGGCTCTATATGGACCTAGATTCACTGGTGTTGGAGCTCAGAT 
TCTTGGCGCTAATGATAAATATTTATGCCAATACGAACAAGACTCTCACAATTTCTGGGG 
AGATCGACATGAGCTAGTTTTGGGGAATACTTTTAGTGCTGTGCCAGGCGCAAAGGCTCC 
1 5 AAACAAGGAGGTTCCACCAGAGGAATTTAACAGAAGAGTGTCCGAAGGGTTCCAAGCTAC 
TCCATTTTGGCATCAAGGTCACATTTGCGATGATGGCCGTACTGACTATGCGGAAACTGT 
GAAATCTGCTCGTTGGGAGTATAGTACTCGTCCCGTTTACGGTTGGGGTGATGTTGGGGC 
O CAAACAGAAGTCAACTGCAGGCTGGCCTGCAGCTTTTCCTGTATTTGAGCCTCATTGGCA 
;;3 GATATGCATGGCAGGAGGCCTTTCCACAGGGTGGATAGAATGGGGCGGTGAAAGGTTTGA ' 
;|0 GTTTCGGGATGCACCTTCTTATTCAGAGAAGAATTGGGGTGGAGGCTTCCCAAGAAAATG 
iij GTTTTGGGTCCAGTGTAATGTCTTTGAAGGGGCAACTGGAGAAGTTGCTTTAACCGCAGG 
jAj TGGCGGGTTGAGGCAATTGCCTGGATTGACTGAGACCTATGAAAATGCTGCACTGGTTTG 
;;=; TGTACACTATGATGGAAAAATGTACGAGTTTGTTCCTTGGAATGGTGTTGTTAGATGGGA 
!: JJ AATGTCTCCCTGGGGTTATTGGTATATAACTGCAGAGAACGAAAACCATGTGGTGGAACT 
'§5 AGAGGCAAGAACAAATGAAGCGGGTACACCTCTGCGTGCTCCTACCACAGAAGTTGGGCT 
; ; s AGCTACGGCTTGCAGAGATAGTTGTTACGGTGAATTGAAGTTGCAGATATGGGAACGGCT 
ATATGATGGAAGTAAAGGCAAGGTGATATTAGAGACAAAGAGCTCAATGGCAGCAGTGGA 
;=f GATAGGAGGAGGACCGTGGTTTGGGACATGGAAAGGAGATACGAGCAACACGCCCGAGCT 
''" ACTAAAACAGGCTCTTCAGGTCCCATTGGATCTTGAAAGCGCCTTAGGTTTGGTCCCTTT 
;;50 CTTCAAGCCACCGGGTCTGTAACATTGATGAGTGTTTTGTTTGTTGATAGAGACCCATGT 
GATGAATGAAGCCTTAGTCATGTCATTGCTAGCTTCACTATTATGTATGTATGATTTTAG 
W TTCGTTCGGTCCTTGTGGTAAATGATACGGGCCAGTGTAAAGT 

<210> 110 
35 <211> 

<212> PRT 

<213> Arabidopsis 

MEIRSLIVSMNPNLSSFELSRPVSPLTRSLVPFRSTKLVPRSISRVSASI 
40 STPNSETDKISVKPVYVPTSPNRELRTPHSGYHFDGTPRKFFEGWYFRVS 
IPEKRESFCFMYSVENPAFRQSLSPLEVALYGPRFTGVGAQILGANDKYL 
CQYEQDSHNFWGDRHELVLGNTFSAVPGAKAPNKEVPPEEFNRRVSEGFQ 
ATPFWHQGHICDDGRTDYAETVKSARWEYSTRPVYGWGDVGAKQKSTAGW 
PAAFPVFEPHWQICMAGGLSTGWIEWGGERFEFRDAPSYSEKNWGGGFPR 
4 5 KWFWVQCNVFEGATGEVALTAGGGLRQLPGLTETYENAALVCVHYDGKMY 
EFVPWNGVVRWEMSPWGYWYITAENENHVVELEARTNEAGTPLRAPTTEV 
GLATACRDSCYGELKLQIWERLYDGSKGKVILETKSSMAAVEIGGGPWFG 
TWKGDTSNTPELLKQALQVPLDLESALGLVPFFKPPGL* 
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